LMSYS Blog

LMSYS Blog

lmsys.org/
27
Articles
11月8日 05:01
Last updated
SGLang Diffusion: Accelerating Video and Image Generation

SGLang Diffusion: Accelerating Video and Image Generation

<p>We are excited to introduce SGLang Diffusion, which brings SGLang's state-of-the-art performance to accelerate image and video generation for diffusion mo...

LMSYS Blog
library tool
'No Free Lunch: Deconstruct Efficient Attention with MiniMax M2'

'No Free Lunch: Deconstruct Efficient Attention with MiniMax M2'

<p>We are excited to announce day-one support for the new flagship model, MiniMax M2, on SGLang. The MiniMax M2 redefines efficiency for agents: it is a comp...

LMSYS Blog
api tool
'No Free Lunch: Deconstruct Efficient Attention with MiniMax M2'

'No Free Lunch: Deconstruct Efficient Attention with MiniMax M2'

<p>We are excited to announce day-one support for the new flagship model, MiniMax M2, on SGLang. The MiniMax M2 redefines efficiency for agents: it is a comp...

LMSYS Blog
tool
Optimizing GPT-OSS on NVIDIA DGX Spark: Getting the Most Out of Your Spark

Optimizing GPT-OSS on NVIDIA DGX Spark: Getting the Most Out of Your Spark

<p>We’ve got some exciting updates about the <strong>NVIDIA DGX Spark</strong>! In the week following the official launch, we collaborated closely with NVIDI...

LMSYS Blog
api tool
SGLang-Jax: An Open-Source Solution for Native TPU Inference

SGLang-Jax: An Open-Source Solution for Native TPU Inference

<p>We're excited to introduce SGLang-Jax, a state-of-the-art open-source inference engine built entirely on Jax and XLA. It leverages SGLang's high-performan...

LMSYS Blog
library tool