跪拜 Guibai
← All articles
Backend

WorkBuddy Hands-On: Turning a Local AI Workbench Into a Real Office Tool

By 倔强的石头_ ·
Read original on juejin.cn ↗ Google Translate ↗ Alt translation

WorkBuddy signals a shift from chat-first AI to task-first AI workbenches — a model that better matches how knowledge workers actually operate. For Western developers building productivity tools, this layered architecture (input → expert → skill → automation) offers a replicable pattern for turning LLMs into reliable office assistants, not just conversational toys.

Summary

Most AI products are built for conversation, but daily office work demands structured outputs: weekly reports, meeting checklists, task breakdowns, and draft notices. WorkBuddy tackles this by organizing work into four distinct layers — Claw for basic input, an Expert Center for role-based prompts, a Skill Center for extending capabilities, and Automation Templates for repetitive workflows.

A recent hands-on integration connected WorkBuddy to Lanyun MaaS using the MiniMax-M2.5 model, chosen for its strong throughput (123.26 tokens/s), low latency (0.19s), and 100% reliability over six hours. The test chain ran through custom model configuration, basic task input in Claw, expert role assignment, skill library exploration, and automation template usage.

The results were practical: Claw returned structured answers focused on office tasks rather than small talk. The Expert Center separated roles by function — summarizing, reporting, copywriting — and the Automation page offered ready-made templates for weekly reports, meeting prep, and daily news. A boundary test showed the system refused to fabricate private data, instead asking clarifying questions about permissions. A more complex test generating a 5-page PPT draft from scratch confirmed the workbench can build structure and compress content, though output quality depends on input specificity.

Takeaways
WorkBuddy organizes AI interaction into four layers: Claw (basic input), Expert Center (role-based prompts), Skill Center (extensions), and Automation Templates (repetitive workflows).
The MiniMax-M2.5 model, accessed via Lanyun MaaS, delivers 123.26 tokens/s throughput, 0.19s latency, and 100% reliability over six hours — metrics suited for high-frequency office tasks.
WorkBuddy uses a credit-based consumption model, making model throughput and cost per token critical for sustained daily use.
Claw returned structured, office-focused answers (summaries, to-dos, notifications) rather than conversational small talk.
The Expert Center separates roles by function — design, engineering, marketing, product, project management — allowing task-specific prompt optimization.
Automation templates cover weekly reports, meeting preparation, daily news, birthday reminders, and interview prep, reducing repetitive prompt engineering.
A boundary test showed WorkBuddy refused to fabricate private data (phone numbers, login info) and instead asked clarifying questions about member identity, scenario, and permissions.
A complex test generating a 5-page PPT draft from scratch confirmed the system can build structure and compress content, but output richness depends on input specificity.
Common issues include model not responding (check API address, model name, API key) and unstable output style (make task descriptions more specific).
Conclusions

The shift from chat-first to task-first AI interfaces is a meaningful architectural choice: it acknowledges that most office work is structured, repetitive, and role-specific, not open-ended conversation.

WorkBuddy's layered design (input → expert → skill → automation) mirrors how knowledge workers actually think: they don't want one AI to do everything; they want specialized agents for specific workflows.

The credit-based consumption model introduces a real-world constraint that many Western developers overlook: model performance isn't just about accuracy, but about throughput, latency, and cost per token under sustained load.

The boundary test result is notable — many AI tools hallucinate private data when asked. WorkBuddy's refusal to fabricate and its request for permission context suggests a deliberate safety design, not just model behavior.

The PPT generation test reveals a limitation: the system can structure output but struggles to fill in rich content when given vague instructions. This reinforces that AI workbenches still depend on human clarity — they amplify skill, not replace it.

WorkBuddy's approach could be generalized: any productivity tool that wants to move from 'AI chat' to 'AI assistant' should consider separating role, skill, and automation layers rather than relying on a single prompt box.

Concepts & terms
WorkBuddy
A local AI workbench that organizes tasks into four layers: Claw (basic input), Expert Center (role-based prompts), Skill Center (extensions), and Automation Templates (repetitive workflows). It uses a credit-based consumption model and supports custom model integration via OpenAI-compatible APIs.
Lanyun MaaS
A Chinese Model-as-a-Service platform that provides access to various LLMs via an OpenAI-compatible API. It offers models like MiniMax-M2.5 with measured throughput, latency, and reliability metrics, and uses a credit-based billing system.
MiniMax-M2.5
A large language model from MiniMax, accessed via Lanyun MaaS. It offers 123.26 tokens/s throughput, 0.19s latency, and 100% reliability over six hours, making it suitable for high-frequency office tasks that require fast, stable responses.
Claw
The basic input module in WorkBuddy where users enter tasks. It serves as the primary interface for connectivity verification and basic task execution, returning structured answers focused on office workflows rather than conversational small talk.
Expert Center
A module in WorkBuddy that provides pre-built expert roles (design, engineering, marketing, product, project management) and allows users to create custom experts. It separates prompts by function, enabling task-specific optimization.
Skill Center
A module in WorkBuddy that extends the workbench's capabilities beyond model generation. It includes installed skills and a recommended skill library for actions like browser operations, document processing, information extraction, and process assistance.
Automation Templates
Pre-built workflow templates in WorkBuddy for common office tasks such as weekly reports, meeting preparation, daily news push, birthday reminders, and interview prep. They fix repetitive processes so users don't need to re-prompt for standard tasks.
Source: juejin.cn ↗ Google Translate ↗ Backup ↗