LMSYS Blog
lmsys.org/
SGLang Diffusion: Accelerating Video and Image Generation
<p>We are excited to introduce SGLang Diffusion, which brings SGLang's state-of-the-art performance to accelerate image and video generation for diffusion mo...
'No Free Lunch: Deconstruct Efficient Attention with MiniMax M2'
<p>We are excited to announce day-one support for the new flagship model, MiniMax M2, on SGLang. The MiniMax M2 redefines efficiency for agents: it is a comp...
'No Free Lunch: Deconstruct Efficient Attention with MiniMax M2'
<p>We are excited to announce day-one support for the new flagship model, MiniMax M2, on SGLang. The MiniMax M2 redefines efficiency for agents: it is a comp...
Optimizing GPT-OSS on NVIDIA DGX Spark: Getting the Most Out of Your Spark
<p>We’ve got some exciting updates about the <strong>NVIDIA DGX Spark</strong>! In the week following the official launch, we collaborated closely with NVIDI...
SGLang-Jax: An Open-Source Solution for Native TPU Inference
<p>We're excited to introduce SGLang-Jax, a state-of-the-art open-source inference engine built entirely on Jax and XLA. It leverages SGLang's high-performan...