DeepTechNews.Asia

GRZO: Group-Relative Zeroth-Order Optimization for Large Language Model Fine-Tuning

Industrial AI

Summary

arXiv:2606.02857v1 Announce Type: new Abstract: Zeroth-order (ZO) optimization is a memory-efficient alternative to backpropagation for fine-tuning large language models, but its deployment is limited by the high variance of gradient estimation. We propose GRZO, a Group-Relative Zeroth-Order optimizer that draws one pseudo-independent perturbation per mini-batch example and aggregates the per-example losses through group-relative normalization, raising the effective gradient-direction count from one to the batch size at no additional forward cost while preserving inference-level memory. We prove that GRZO is directionally unbiased with variance shrinking proportionally to the batch size, yielding a tighter nonconvex convergence bound than MeZO.

Why It Matters

This Industrial AI development deepens the link between AI compute and industrial productivity. For Asia, it is a signal worth tracking: it shapes who supplies, who scales, and who sets the standard over the next five years.

Key Facts

  • SectorIndustrial AI
  • Market
  • ImpactLow (42/100)
  • SignalResearch

Original Sources

arXiv AI / Machine Learning ↗ https://arxiv.org/abs/2606.02857

Related Stories