r/LocalLLaMA • 90일 전

미스트랄 '미디엄 3.5' 모델 공개 및 클라우드 코딩 에이전트 출시

IMP

8/10

핵심 요약

미스트랄이 지시어 준수, 추론, 코딩 능력을 결합한 128B 밀집형 플래그십 모델 'Mistral Medium 3.5'를 공개했습니다. 이와 함께 코딩 에이전트 'Vibe'를 클라우드로 확장하여 비동기 방식의 병렬 작업을 지원하며, 'Le Chat'에는 복잡한 다단계 작업을 수행하는 새로운 'Work mode(작업 모드)'를 도입했습니다.

번역된 본문

노드를 선택하려면 Enter 또는 Space를 누르십시오. 그런 다음 화살표 키를 사용하여 노드를 이동할 수 있습니다. 삭제하려면 Delete를, 취소하려면 Escape를 누르십시오. 엣지를 선택하려면 Enter 또는 Space를 누르십시오. 그런 다음 Delete를 눌러 삭제하거나 Escape를 눌러 취소할 수 있습니다.

Vibe의 원격 에이전트. Mistral Medium 3.5 기반 구동.

Mistral Medium 3.5, Vibe의 원격 코딩 에이전트, 그리고 복잡한 작업을 위한 Le Chat의 새로운 Work mode(작업 모드)를 소개합니다.

지금까지 코딩 에이전트는 주로 사용자의 노트북에서 실행되었습니다. 오늘부터 이를 클라우드로 옮겨, 에이전트가 독립적이고 병렬적으로 작업을 실행하며 완료되면 알림을 보내도록 했습니다. Mistral Vibe CLI 또는 Le Chat에서 바로 클라우드 에이전트를 시작하여 대화를 떠나지 않고도 코딩 작업을 위임할 수 있습니다.

이를 가능하게 하는 것은 퍼블릭 프리뷰로 공개된 Mistral Medium 3.5입니다. Mistral Vibe와 Le Chat의 새로운 기본 모델로 자리 잡은 이 모델은 코딩 및 생산성 작업을 위해 장시간 실행되도록 설계되었습니다. Le Chat의 새로운 Work mode (프리뷰)는 리서치, 분석, 다양한 도구 간의 상호 작용과 같은 복잡한 다단계 작업을 수행하는 강력한 에이전트를 통해 이를 확장합니다.

주요 특징

Mistral Medium 3.5: 지시어 준수, 추론, 코딩 기능을 하나의 128B 밀집형(Dense) 모델로 결합한 새로운 플래그십 모델입니다. 수정된 MIT 라이선스에 따라 오픈 웨이트(Open weights)로 공개되었습니다. 단 4개의 GPU만으로도 자체 호스팅(Self-hosting)이 가능한 크기로 실제 환경에서 강력한 성능을 발휘합니다.
비동기 코딩을 위한 Mistral Vibe 원격 에이전트: 세션이 클라우드에서 실행되며, CLI 또는 Le Chat에서 생성할 수 있고, 로컬 CLI 세션을 클라우드로 원격 전송(Teleport)할 수 있습니다.
Le Chat에서 Mistral Vibe 코딩 작업 시작: 세션은 동일한 원격 런타임에서 실행되며, 사용자가 자리를 비운 사이에도 작업이 계속 진행됩니다.
Le Chat의 Work mode: Mistral Medium 3.5 기반의 새로운 에이전트가 구동되어 작업이 완료될 때까지 병렬로 도구를 호출하며 다단계 작업을 수행합니다.

Mistral Medium 3.5 Mistral Medium 3.5는 퍼블릭 프리뷰로 제공되는 당사의 첫 번째 결합형 플래그십 모델입니다. 256k 컨텍스트 윈도우를 갖춘 128B 밀집형 모델로, 단일 웨이트(Weight) 세트로 지시어 준수, 추론, 코딩을 모두 처리합니다. 최소 4개의 GPU로 자체 호스팅이 가능하며 실제 사용 환경에서 강력한 성능을 보여줍니다. 이제 요청별로 추론 노력(Reasoning effort)을 구성할 수 있어, 동일한 모델로 빠른 채팅 답변을 제공하거나 복잡한 에이전트 실행 작업을 처리할 수 있습니다. 다양한 이미지 크기와 가로세로 비율을 처리하기 위해 비전 인코더(Vision encoder)를 처음부터 새로 학습시켰습니다.

Mistral Medium 3.5는 SWE-Bench Verified에서 77.6%를 기록하며 Devstral 2 및 Qwen3.5 397B A17B와 같은 모델을 앞섰습니다. 또한 강력한 에이전트 기능을 갖추고 있으며 τ³-Telecom에서 91.4점을 기록했습니다. 이 모델은 여러 도구를 안정적으로 호출하고 다운스트림 코드에서 사용할 수 있는 구조화된 출력을 생성하면서 장기화되는 작업(Long-horizon tasks)을 수행하기 위해 제작되었습니다. 이는 Vibe의 비동기 클라우드 에이전트를 실용적으로 출시할 수 있게 만든 핵심 모델입니다.

Mistral Medium 3.5는 Le Chat의 기본 모델이 됩니다. 또한 당사의 코딩 에이전트인 Vibe CLI에서 Devstral 2를 대체합니다.

Vibe 원격 에이전트 오늘부터 코딩 세션은 사용자가 자리를 비운 동안에도 긴 작업을 처리할 수 있습니다. 여러 세션을 병렬로 실행할 수 있으므로, 에이전트의 모든 단계마다 사용자가 병목 현상을 일으키는 일이 사라집니다. Mistral Vibe CLI 또는 Le Chat에서 클라우드 에이전트를 시작할 수 있습니다. 에이전트가 실행되는 동안 파일 변경 사항(diff), 도구 호출, 진행 상태, 질문 등이 실시간으로 표시되어 에이전트의 작업 상태를 확인할 수 있습니다.

실행 중인 로컬 CLI 세션을 계속 실행하고 싶을 때 세션 기록, 작업 상태 및 승인 상태를 그대로 유지하면서 클라우드로 전송할 수 있습니다. Vibe는 기존 시스템 엔지니어링 팀이 사용하는 시스템과 연동되며, 필요할 때마다 인간이 개입(Human-in-the-loop)할 수 있습니다. 코드 및 풀 리퀘스트를 위해 GitHub와, 이슈 추적을 위해 Linear 및 Jira와, 인시던트 관리를 위해 Sentry와, 그리고 보고를 위해 Slack이나 Teams 같은 앱과 연동됩니다.

각 코딩 세션은 광범위한 편집 및 설치를 포함하여 격리된 샌드박스에서 실행됩니다. 작업이 완료되면 에이전트가 GitHub에 풀 리퀘스트를 열고 사용자에게 알림을 보냅니다. 이를 통해 모든 타이핑 과정을 일일이 검토하는 대신 최종 결과물만 검토할 수 있습니다. 이는 개발자의 시간을 빼앗으면서도 창의성을 필요로 하지 않는 대용량의 명확한 작업을 처리하는 데 매우 적합합니다.

원문 보기

원문 보기 (영어)

Press enter or space to select a node. You can then use the arrow keys to move the node around. Press delete to remove it and escape to cancel. Press enter or space to select an edge. You can then press delete to remove it or escape to cancel. Remote agents in Vibe Remote agents in Vibe. Powered by Mistral Medium 3.5. Introducing Mistral Medium 3.5, remote coding agents in Vibe, plus new Work mode in Le Chat for complex tasks. Coding agents have mostly lived on your laptop. Today we're moving them to the cloud, where they run on their own, in parallel, and notify you when they're done. You can start them from the Mistral Vibe CLI or directly in Le Chat, offloading a coding task without leaving the conversation. Powering this is Mistral Medium 3.5 in public preview, our new default model in Mistral Vibe and Le Chat, built to run for long stretches on coding and productivity work. The new Work mode in Le Chat (Preview) extends this with a powerful agent for complex, multi-step tasks like research, analysis, and cross-tool actions. Highlights. Mistral Medium 3.5, a new flagship model that merges instruction-following, reasoning, and coding into a single 128B dense model. Released as open weights, under a modified MIT license. Strong real-world performance at a size that runs self-hosted on as few as four GPUs. Mistral Vibe remote agents for async coding: sessions run in the cloud, can be spawned from the CLI or Le Chat, and a local CLI session can be teleported up to the cloud. Start Mistral Vibe coding tasks in Le Chat. Sessions run on the same remote runtime and keep going while you step away. Work mode in Le Chat runs on a new agent, powered by Mistral Medium 3.5, that works through multi-step tasks, calling tools in parallel until the job is done. Mistral Medium 3.5. Mistral Medium 3.5 is our first flagship merged model, available in public preview. It is a dense 128B model with a 256k context window, handling instruction-following, reasoning, and coding in a single set of weights. It performs strongly in real-world use, with self-hosting possible on as few as four GPUs. Reasoning effort is now configurable per request, so the same model can answer a quick chat reply or work through a complex agentic run. We trained the vision encoder from scratch to handle variable image sizes and aspect ratios. Mistral Medium 3.5 scores 77.6% on SWE-Bench Verified, ahead of Devstral 2 and models like Qwen3.5 397B A17B. It also has strong agentic capabilities and scores 91.4 on τ³-Telecom. The model was built for long-horizon tasks, calling multiple tools reliably, and producing structured output that downstream code can consume. It is the model that made async cloud agents in Vibe practical to ship. Mistral Medium 3.5 becomes the default model in Le Chat. It also replaces Devstral 2 in our coding agent, Vibe CLI. Vibe remote agents. From today, coding sessions can work through long tasks while you’re away. Many can run in parallel, and you stop being the bottleneck on every step the agent takes. You can start the cloud agents from the Mistral Vibe CLI or from Le Chat. While they run, you can inspect what the agent is doing, with file diffs, tool calls, progress states, and questions surfaced as you go. Ongoing local CLI sessions can be teleported up to the cloud when you want to leave them running, with session history, task state, and approvals carrying across. Vibe sits between the systems engineering teams already use, with humans in the loop wherever they're needed. It plugs into GitHub for code and pull requests, Linear and Jira for issues, Sentry for incidents, and apps like Slack or Teams for reporting. Each coding session runs in an isolated sandbox, including broad edits and installs. When the work is done, the agent can open a pull request on GitHub and notify you, so you review the result instead of every keystroke that produced it. It fits the high-volume, well-defined work that takes a developer's time without taking their judgment: module refactors, test generation, dependency upgrades, CI investigations, as well as bug fixes. We use Workflows orchestrated in Mistral Studio to bring Mistral Vibe into Le Chat. We originally built this for our own in-house coding environment, then for our enterprise customers . Today the capability opens up to everyone, who can now launch coding tasks from the web. And without being tied to a local terminal, a developer can run several in parallel. You can start coding sessions directly in Le Chat, so a task described in chat runs on the same remote runtime as the CLI and the web, and comes back later as a finished branch or a draft PR. New Work mode in Le Chat (Preview). Work mode is a powerful new agentic mode for complex tasks in Le Chat, powered by a new harness and Mistral Medium 3.5. The agent becomes the execution backend for the assistant itself, so Le Chat can read and write, use several tools at once, and work through multi-step projects until it completes what you’ve asked. Here’s what Work mode enables you to do today. Cross-tool workflows: catch up across email, messages, and calendar in a single run; prepare for a meeting with attendee context, latest news, and talking points pulled from your sources. Research and synthesis: dive into a topic across the web, internal docs, and connected tools, then produce a structured brief or report you can edit before exporting or sending. Triage your inbox and draft replies; create issues in Jira from your team and customer discussions; send a summary to your team on Slack. Sessions persist longer than a typical chat reply, so an agent can keep going across many turns, through trial-and-error, and through to completion. In Work mode, connectors are on by default rather than chosen manually, which lets the agent reach into documents, mailboxes, calendars, and other systems for the rich context it needs to take correct action. Every action the agent takes is visible: you see each tool call and the thinking rationale. Le Chat will ask for explicit approval—based on your permissions—before proceeding with sensitive tasks like sending a message, writing a document, or modifying data. Get started. Mistral Medium 3.5 is available today in Mistral Vibe and Le Chat , and powers remote coding agents and Work mode in Le Chat on the Pro, Team, and Enterprise plans . Through API, it’s priced at $1.5 per million input tokens and $7.5 per million output tokens. Open weights are on Hugging Face under a modified MIT license. It is also available for prototyping, hosted on NVIDIA GPU-accelerated endpoints on build.nvidia.com and as a scalable containerized inference microservice, NVIDIA NIM . Build the future of agentic systems with us. We're hiring across research, engineering, and product to push agentic systems further. See our open roles .

미스트랄 오픈소스-모델 코딩-에이전트 클라우드