- LLM decoding algorithm: Top-W - Geometry-Aware Decoding with Wasserstein-Regularized Truncation and Mass Penalties for Large Language Models
No download links available.