Core idea: optimize one coordinate at a time. Why it matters: it is the cleanest possible derivative-free baseline. Main weakness: axis directions are fixed, so diagonal valleys can trap it badly.
核心思想是逐坐标优化。它的重要性在于提供了最原始的无导数起点;但方向固定在坐标轴上,所以一旦谷底斜着走,它就会很笨。
Core idea: still use line searches, but let directions evolve from actual progress. Improvement over cyclic: directions can gradually align with the valley. New issue: full line searches remain expensive, especially in high dimensions.
它保留线搜索的精度,但方向不再死板,而是逐步学习函数地形;问题是线搜索本身仍然贵。
Core idea: replace full line searches with cheap ±α probes. Improvement over Powell: much cheaper per move. New issue: checking all 2n coordinate directions can itself become wasteful.
它用“试一小步看看有没有变好”代替完整线搜索,更像试探式走路;但在高维时,2n 个方向依然不少。
Core idea: use any positive spanning set, not necessarily all ± coordinate directions. Improvement over Hooke–Jeeves: only enough directions to guarantee a descent opportunity. New issue: it is still a single-point local method.
它把“哪些方向足够用”这件事抽象成正生成集,因此比单纯坐标探测更一般也更省;但本质上还是局部搜索。
Core idea: carry a whole simplex of n+1 points and deform its shape. Improvement over point-based methods: geometry now comes from a moving body, not one point plus test directions. New issue: still local, and theoretical guarantees are weaker than the neat positive-spanning framework.
它不再只盯住一个点,而是让一个会变形的几何体往低处滑,直观性很强;但它仍然可能困在局部极小值。
Core idea: partition the whole domain and select potentially optimal rectangles using lower-bound logic. Why it is different: this is the chapter’s global method. Tradeoff: broader exploration, slower fine local polishing.
DIRECT 不再问“当前点附近怎么走”,而是问“整个定义域里哪些区域最值得继续分裂”。这是本章从局部思维转向全局思维的关键一步。
| Method | Main move | Typical cost | Local / Global | Best for |
|---|---|---|---|---|
| Cyclic 循环坐标 | Axis-by-axis line search | 1 line search each coordinate move | Local | Teaching intuition, very simple structure |
| Powell Powell | Adaptive direction line searches | Many line searches per cycle | Local | Smooth valley-like problems without derivatives |
| Hooke–Jeeves | ±α local probes | About 2n evaluations per exploratory step | Local | Cheap function evaluations, simple implementation |
| Pattern Search 模式搜索 | Positive spanning directions | At least n+1 directions | Local | More principled direct search design |
| Nelder–Mead | Simplex deformation | n+1 vertices updated geometrically | Local | Low-dimensional black-box tuning |
| DIRECT | Rectangle partition + lower bound | Global partition budget grows over time | Global | Unknown landscape, many local minima |