Marvis Turns Your Phone Into a Second Computer — And It Actually Understands Your Files
This is Cang He's 553rd original article!
Hello everyone, I'm Cang He.
Recently, I've been spending half my time on the road — on high-speed trains, in hotels, at conferences.
Often, I have my phone at hand, but critical files are on my home computer.
Before, if I needed to change something and only had my phone, I was stuck.
Until I used Marvis — my phone became a second computer.
Now, when I'm out and want Codex to run a task, I pull out my phone and connect to my Mac mini at home.
I use voice input on my phone to issue commands, and it executes them directly on the computer. Above the dialog box, I can see Codex's real-time progress.
Recently, Codex's Cowart infinite canvas plugin has been popular, so I had it install and test it on my remote computer.
On the left is the real-time desktop of the Mac mini; on the right is the Marvis dialog box. Just send commands — the experience feels no different from sitting in front of the computer.
Sometimes the computer is locked — no problem. Enter the password on the phone to wake it up and continue working.
It feels amazing — you're out, but the computer never leaves you.
Footage captured outside can also be sent back directly.
I had Marvis sync the phone's material to the corresponding folder on the Mac mini, ready for video editing without needing to transfer again.
I also had it classify and organize the files — much more convenient than doing it manually.
Before, the first thing after a business trip was to transfer material; now I can start editing as soon as I land.
Marvis also has built-in skills. I tried one — asked it to make a product promotion PPT.
Wow, it actually made one, and the result was pretty good.
You can find the skills you need just like browsing Xiaohongshu.
Android connecting to Mac, iOS connecting to Windows — no problem. It's quite comprehensive.
Recently, my Windows computer was a bit slow, so I had Marvis do a full health check.
It scanned hardware configuration, disk space, cleanable junk, system performance, battery health, analyzed my usage habits, and gave suggestions and an action plan.
After optimizing according to the suggestions, it became much faster.
I also had it clean up the desktop.
Content creators know how messy desktops can get — no explanation needed. Send a message from the phone, and it organizes everything by category. Looks much cleaner.
Yesterday, a friend asked if I had a demo video for GEO. I remembered it was on the computer, so I had Marvis find it.
In less than a minute, it sent the file directly.
Before, I would have had to remote into the computer and search for ages.
Now, out and about, one sentence does the job.
Speaking of which, there's one thing about Marvis that I find particularly special.
Marvis is not just a remote control — it actively understands the content on your computer.
After authorization, it analyzes existing files.
Images can be categorized into people, landscapes, posters, locations.
It can also recognize faces.
Local files can be sorted into types like courseware, spreadsheets, contracts, PDFs.
You can also directly view all files on the computer.
My first thought was, "How is privacy guaranteed?"
Later I learned that file recognition uses on-device local models — runs on your own computer. When building the knowledge base, all information stays local; it only reads after authorization. The cloud does not save user information; data is deleted after each conversation ends.
I have a lot of important materials on my computer, so this is crucial for me.
The local knowledge base transforms the computer from "you remember where things are" to "you describe what you want."
One thing to note: you need to describe what you want clearly, otherwise Marvis might not find it.
Marvis also has a local mode for scenarios with higher privacy requirements. All computation runs locally; data never leaves the device.
However, local mode has hardware requirements: processor with 16+ cores, 32 GB RAM, 16+ GB VRAM, driver version 535.0+.
Honestly, writing this, I feel a bit emotional.
Before, many things had to wait until I got back. Now, I just pull out my phone and connect directly to my six little horses.
Inspiration strikes anytime, anywhere — vibe coding on the go.
This year, AI agents are everywhere; many can operate your computer.
But most agents are tied to a single device — you have to sit in front of the computer to use them.
What sets Marvis apart is that it completely decouples "where you are" from "where the computer is."
Traditional remote control is "I'm looking at another screen." Now it's "I'm talking to an agent that understands my computer."
With the former, you're still operating. With the latter, you only need to express intent.
When the walls between devices are broken down, you gradually forget about buttons and only remember what you want.
Give Marvis a try — carry your home computer with you.
What do you think AI agents should help you with the most? Discuss in the comments.