【AI產業週報】Grok 推出影像編輯功能，Claude 3.7 挑戰寶可夢通關

在此整理多項生成式 AI 新技術資訊，從 Grok 的影像編輯功能、Tencent 的 3D 生成模型到 Claude 3.7 Sonnet 的遊戲測試等，展現了 AI 技術在多領域的快速演進。

Grok 推出影像編輯功能

Goodbye Photoshop

Grok 3 can now edit any image in seconds.

13 Wild Examples you don't want to miss: pic.twitter.com/s6y1yi4xov
— Poonam Soni (@CodeByPoonam) March 21, 2025

Grok 近日新增了影像編輯功能，不僅可以編輯 AI 生成的圖像，還能為手繪草圖上色，或對既有圖像進行加工。據測試報告顯示，雖然線稿在上色過程中有所變形，色彩也較為淡雅，但透過多次嘗試可能會產生更好的結果。

不過似乎無法理解漫畫挖框需求，會產出十分感人的結果。

Grok launched an "Edit Image" feature, and when asked to help clear out dialogue bubbles from comics, the results were surprising. pic.twitter.com/xV9elKVgmG
— 吹著魔笛的浮士德 | QBNews (@h98569856) March 24, 2025

多項 3D 生成 AI 工具亮相

Run Cube 3D on Windows, Linux, and even Mac!

This is the first 3D generation model that works out of the box on ALL platforms.

I wrote a simple gradio app and a 1-click launcher.

For the first time, Mac users can generate 3D objects with an AI model. Watch the video. https://t.co/OVVWWdLhpj pic.twitter.com/xmfW78boae
— cocktail peanut (@cocktailpeanut) March 21, 2025

Cube 3D 作為一款跨平台的 3D 生成 AI 模型，已在 Windows、Linux 和 Mac 等多種作業系統上可供使用。同時，Tencent 開發的開源 3D 生成模型「Hunyuan3D 2.0」及其多視角生成模型「Hunyuan3D 2.0 MV」已獲 ComfyUI 原生支援，進一步拓展了 3D 內容創作的可能性。

We are happy to announce that ComfyUI now natively supports @TXhunyuan Hunyuan3D 2.0 and Hunyuan3D 2.0 MV (Multi-View) model series!

3 workflows to get started:

🔹Hunyuan3D-2 mv: multi-view to 3D
🔹Hunyuan3D-2 mv Turbo: accelerated multi-view
🔹Hunyuan3D-2: single image to 3D pic.twitter.com/o8bYPout1v
— ComfyUI (@ComfyUI) March 22, 2025

AI 輔助遊戲開發與影片生成

PlayCanvas 推出的 editor-mcp-server 工具利用 Anthropic 的 Claude 自動化遊戲編輯器操作，包括實體創建、修改和刪除，以及組件管理、腳本編輯等功能。

Built an open source MCP server for that allows @AnthropicAI Claude to control the @PlayCanvas Editor. Just gave it the prompt "Build me a fun FPS level" and it just did it! This is a game-changer! 🤯 pic.twitter.com/fUk6IlwL1E
— Will Eastcott (@willeastcott) March 21, 2025

https://github.com/playcanvas/editor-mcp-server

在影片生成方面，Vidu 更新了「多參考一致性」功能，提升了影片生成的穩定性，並允許使用最多 7 張參考圖像，大幅增強了影片創作的靈活性。

Leading Vidu isn’t just about innovation—it’s about having fun while pushing creative boundaries! 🚀 From wild AI experiments to late-night idea sparks, every step is an adventure. Who else loves breaking the limits of creativity? Let’s make something awesome. 🎥✨ #ViduAI… pic.twitter.com/nyTrpHGyir
— Evan Liao (@evanLiaoQ) March 21, 2025

語言模型的新應用與測試

AI 框架 BAML 近期使用 Gemma-3 和 gpt-4o-mini 兩種大型語言模型進行了數據集豐富化實驗。研究顯示，Gemma-3 在微調和世界知識任務方面表現尤為出色。

https://thedataquarry.com/blog/using-llms-to-enrich-datasets

另一方面，Anthropic 的最新模型「Claude 3.7 Sonnet」被用於測試遊玩經典遊戲「精靈寶可夢」的能力。研究結果顯示，該模型仍未能完全掌握遊戲，尤其在處理知識基礎中的錯誤訊息方面存在挑戰，這導致了遊戲進程的障礙。

Google Project Astra 功能開始部署

Google 在 MWC 發表的 Project Astra 相關功能已開始向 Android 版 Gemini Live 推出。部分 Android 用戶現可透過 Gemini Live 分享手機畫面或通過相機即時展示周圍環境，強化了行動裝置的 AI 互動體驗。

本週的技術進展展示了生成式 AI 在影像處理、3D 建模、遊戲開發和行動應用等多領域的持續創新，為內容創作者和開發者提供了更多可能性。

參考資料

【生成AIニュース+】『Grokのedit image』『Cube 3D』『editor-mcp-server』『ComfyUIでHunyuan3D 2.0』『ViduのReference』『BAMLでのGemma3』『Claude 3.7 Sonnetがポケモンをプレイ』『GoogleのProject Astra』

合作廣告

Grok 推出影像編輯功能

多項 3D 生成 AI 工具亮相

AI 輔助遊戲開發與影片生成

語言模型的新應用與測試

Google Project Astra 功能開始部署

吹著魔笛的浮士德

美國娛樂軟體協會聯合多家遊戲公司推出無障礙遊戲標籤倡議

台港女生創作《數碼天堂》登陸英倫遊戲節及美國GDC　新增女羅賓漢中世紀聯動劇場

【AI產業週報】Grok 推出影像編輯功能，Claude 3.7 挑戰寶可夢通關

Grok 推出影像編輯功能

多項 3D 生成 AI 工具亮相

AI 輔助遊戲開發與影片生成

語言模型的新應用與測試

Google Project Astra 功能開始部署

吹著魔笛的浮士德

美國娛樂軟體協會聯合多家遊戲公司推出無障礙遊戲標籤倡議

台港女生創作《數碼天堂》登陸英倫遊戲節及美國GDC 新增女羅賓漢中世紀聯動劇場

台港女生創作《數碼天堂》登陸英倫遊戲節及美國GDC　新增女羅賓漢中世紀聯動劇場