
Chaos erupts as cyberattack disrupts learning platform Canvas amid finals
Across the country, schools and colleges postpone year-end tests.

Across the country, schools and colleges postpone year-end tests.
The Small Brief is an initiative partnering three advertising industry leaders to create AI-generated advertisements...

Zyphra releases ZAYA1-8B, a reasoning Mixture of Experts model with only 760M active parameters that outperforms much...

A celebration of the tweaks and customizations that make life easier at the CLI.

Daemon Tools users: It's time to check your machines for stealthy infections, stat.

Car manufacturers are exploring AI and large language models to accelerate vehicle design and development processes,...

Amazon SageMaker AI has introduced agentic fine-tuning capabilities to help developers customize language models, with...

Amid falling revenue and store closures, GameStop wants to buy the much larger eBay.

This article provides a developer's guide to systematic prompting techniques for large language models, focusing on...
Xiaomi released MiMo-V2.5-Pro, an open-weight model that nearly matches Anthropic's Claude Opus 4.6 on coding...

The article discusses tokenization drift, a subtle but critical issue where AI models can degrade in performance due to...

Analysis of OpenAI's GPT-5.5 and Anthropic's Opus 4.7 on the ARC-AGI-3 benchmark reveals three systematic reasoning...

A hands-on tutorial guide for post-training large language models using the TRL library, covering four key techniques:...

Qwen AI has released Qwen-Scope, an open-source Sparse Autoencoder suite that provides tools to interpret and utilize...

Moonshot AI has open-sourced FlashKDA, a high-performance implementation of Kimi Delta Attention optimized for the...

Tencent released a compact 440 MB AI translation model as an open-weight model that supports 33 languages and runs...

IBM has released two versions of the Granite Speech 4.1 2B model for automatic speech recognition (ASR), featuring both...

The QwenLM team released FlashQLA, a high-performance kernel library that accelerates linear attention mechanisms,...

SenseTime, a sanctioned Chinese AI firm, has released a new image generation model optimized to run on Chinese-made...