Multimodal - AI Present

中

OpenAI, Multimodal, Image Generation, GPT-Image-2

GPT-Image-2 Breakthrough: When AI Learned to "Think" Before Drawing

OpenAI releases GPT-Image-2 (ChatGPT Images 2.0), the first image model with reasoning capabilities.

AI, Alibaba, Digital Human, Multimodal

Alibaba's «Little Dimple» Debuts: I Dug Into the Details So You Don't Have To

Alibaba's April 22 launch of their digital human «Little Dimple» (Xiaojiuwo) promises a «Hello World

Anthropic, Multimodal, Claude Opus 4.7, Coding

Claude Opus 4.7 Arrives: 13% Coding Boost with Quiet Confidence

Anthropic released Claude Opus 4.7 on April 16, with a 13% boost in coding benchmarks and support fo

Multimodal, AI Model, Meta, Muse Spark

Meta Muse Spark: Early-Stage But Architecturally Interesting Multimodal Model

Meta releases Muse Spark multimodal model. While capabilities remain early-stage, the architecture r

Multimodal, Edge AI, SenseTime Junying, Sage Model, Automotive AI

SenseTime Sage: Fitting a 32B Multimodal Model Into Your Car

SenseTime's Junying releases Sage, an edge-deployed multimodal agent model with 32B parameters runni

Multimodal, Edge AI, SenseTime Junying, Sage Model, Automotive AI

SenseTime Sage: Fitting a 32B Multimodal Model Into Your Car

SenseTime's Junying releases Sage, an edge-deployed multimodal agent model with 32B parameters runni