OpenAI, Multimodal, Image Generation, GPT-Image-2 GPT-Image-2 Breakthrough: When AI Learned to "Think" Before Drawing OpenAI releases GPT-Image-2 (ChatGPT Images 2.0), the first image model with reasoning capabilities.
AI, Alibaba, Digital Human, Multimodal Alibaba's «Little Dimple» Debuts: I Dug Into the Details So You Don't Have To Alibaba's April 22 launch of their digital human «Little Dimple» (Xiaojiuwo) promises a «Hello World
Anthropic, Multimodal, Claude Opus 4.7, Coding Claude Opus 4.7 Arrives: 13% Coding Boost with Quiet Confidence Anthropic released Claude Opus 4.7 on April 16, with a 13% boost in coding benchmarks and support fo
Multimodal, AI Model, Meta, Muse Spark Meta Muse Spark: Early-Stage But Architecturally Interesting Multimodal Model Meta releases Muse Spark multimodal model. While capabilities remain early-stage, the architecture r
Multimodal, Edge AI, SenseTime Junying, Sage Model, Automotive AI SenseTime Sage: Fitting a 32B Multimodal Model Into Your Car SenseTime's Junying releases Sage, an edge-deployed multimodal agent model with 32B parameters runni
Multimodal, Edge AI, SenseTime Junying, Sage Model, Automotive AI SenseTime Sage: Fitting a 32B Multimodal Model Into Your Car SenseTime's Junying releases Sage, an edge-deployed multimodal agent model with 32B parameters runni