Unknown osv · PYSEC-2025-63

PYSEC-2025-63

Published Mar 19, 2025

vLLM is a high-throughput and memory-efficient inference and serving engine for LLMs. When vLLM is configured to use Mooncake, unsafe deserialization exposed directly over ZMQ/TCP on all network interfaces will allow attackers to execute remote code on distributed hosts. This is a remote code execution vulnerability impacting any deployments using Mooncake to distribute KV across distributed hosts. This vulnerability is fixed in 0.8.0.

Remote Code Execution

Affected AI Products

vllm

View original source

Get the weekly digest. Every Monday: top AI security stories of the week. Free.