Research & Papers

Latest AI research

The MIT-IBM Computing Research Lab launches to shape the future of AI and quantum computing
Research & Papers

The MIT-IBM Computing Research Lab launches to shape the future of AI and quantum computing

MIT and IBM have launched a new computing research lab focused on the convergence of AI, algorithms, and quantum...

MIT News AI
Meta FAIR Releases NeuralSet: A Python Package for Neuro-AI That Supports fMRI, M/EEG, Spikes, and HuggingFace Embeddings
Research & PapersMeta

Meta FAIR Releases NeuralSet: A Python Package for Neuro-AI That Supports fMRI, M/EEG, Spikes, and HuggingFace Embeddings

Meta FAIR has released NeuralSet, a Python package that bridges neuroscience and artificial intelligence by supporting...

MarkTechPost
smol-audio: A Colab-Friendly Notebook Collection for Fine-Tuning Whisper, Parakeet, Voxtral, Granite Speech, and Audio Flamingo 3
Research & Papers

smol-audio: A Colab-Friendly Notebook Collection for Fine-Tuning Whisper, Parakeet, Voxtral, Granite Speech, and Audio Flamingo 3

smol-audio is a collection of Google Colab-friendly notebooks designed for fine-tuning multiple audio AI models...

MarkTechPost
Enabling privacy-preserving AI training on everyday devices
Research & Papers

Enabling privacy-preserving AI training on everyday devices

A new method enables privacy-preserving AI training on everyday devices, making accurate and efficient AI models...

MIT News AI
The evolution of encoders: From simple models to multimodal AI
Research & Papers

The evolution of encoders: From simple models to multimodal AI

The article explores encoders as a fundamental component of AI systems that convert real-world information into...

AI News
Researchers find AI text is making the internet more uniform and weirdly cheerful
Research & Papers

Researchers find AI text is making the internet more uniform and weirdly cheerful

A large-scale analysis of Internet Archive websites reveals that AI-generated text is already widespread on the web,...

The Decoder
Top 10 Physical AI Models Powering Real-World Robots in 2026
Research & Papers

Top 10 Physical AI Models Powering Real-World Robots in 2026

A new class of foundation models purpose-built for physical action rather than text generation is now being deployed on...

MarkTechPost
How to Build a Lightweight Vision-Language-Action-Inspired Embodied Agent with Latent World Modeling and Model Predictive Control
Research & Papers

How to Build a Lightweight Vision-Language-Action-Inspired Embodied Agent with Latent World Modeling and Model Predictive Control

This tutorial demonstrates how to build a lightweight embodied AI agent that learns to perceive, plan, and act directly...

MarkTechPost
Meet Talkie-1930: A 13B Open-Weight LLM Trained on Pre-1931 English Text for Historical Reasoning and Generalization Research
Research & Papers

Meet Talkie-1930: A 13B Open-Weight LLM Trained on Pre-1931 English Text for Historical Reasoning and Generalization Research

Researchers have developed Talkie-1930, a 13 billion parameter open-weight language model trained exclusively on...

MarkTechPost
Meta AI Releases Sapiens2: A High-Resolution Human-Centric Vision Model for Pose, Segmentation, Normals, Pointmap, and Albedo
Research & PapersMeta

Meta AI Releases Sapiens2: A High-Resolution Human-Centric Vision Model for Pose, Segmentation, Normals, Pointmap, and Albedo

Meta AI has released Sapiens2, a foundation model for human-centric vision tasks that achieves state-of-the-art...

MarkTechPost
Announcing our partnership with the Republic of Korea
Research & PapersGoogle

Announcing our partnership with the Republic of Korea

Google DeepMind has announced a partnership with the Republic of Korea to leverage frontier AI models for accelerating...

DeepMind Blog
A faster way to estimate AI power consumption
Research & Papers

A faster way to estimate AI power consumption

EnergAIzer is a new method that quickly estimates AI power consumption in seconds, providing reliable results for data...

MIT News AI
How to Build Smarter Multilingual Text Wrapping with BudouX Through Parsing, HTML Rendering, Model Introspection, and Toy Training
Research & Papers

How to Build Smarter Multilingual Text Wrapping with BudouX Through Parsing, HTML Rendering, Model Introspection, and Toy Training

This tutorial explores BudouX, a library for intelligent phrase-aware line breaking in languages without natural...

MarkTechPost
RAG Without Vectors: How PageIndex Retrieves by Reasoning
Research & Papers

RAG Without Vectors: How PageIndex Retrieves by Reasoning

The article discusses PageIndex, a RAG (Retrieval-Augmented Generation) system that retrieves information through...

MarkTechPost
Google DeepMind Introduces Vision Banana: An Instruction-Tuned Image Generator That Beats SAM 3 on Segmentation and Depth Anything V3 on Metric Depth Estimation
Research & PapersGoogle

Google DeepMind Introduces Vision Banana: An Instruction-Tuned Image Generator That Beats SAM 3 on Segmentation and Depth Anything V3 on Metric Depth Estimation

Google DeepMind introduces Vision Banana, an instruction-tuned image generator that outperforms SAM 3 on segmentation...

MarkTechPost
MIT scientists build the world’s largest collection of Olympiad-level math problems, and open it to everyone
Research & Papers

MIT scientists build the world’s largest collection of Olympiad-level math problems, and open it to everyone

MIT scientists have created the world's largest collection of over 30,000 Olympiad-level math problems from 47...

MIT News AI
Google DeepMind Introduces Decoupled DiLoCo: An Asynchronous Training Architecture Achieving 88% Goodput Under High Hardware Failure Rates
Research & PapersGoogle

Google DeepMind Introduces Decoupled DiLoCo: An Asynchronous Training Architecture Achieving 88% Goodput Under High Hardware Failure Rates

Google DeepMind has introduced Decoupled DiLoCo, an asynchronous training architecture designed to address hardware...

MarkTechPost
A Coding Tutorial on OpenMythos on Recurrent-Depth Transformers with Depth Extrapolation, Adaptive Computation, and Mixture-of-Experts Routing
Research & PapersAnthropic

A Coding Tutorial on OpenMythos on Recurrent-Depth Transformers with Depth Extrapolation, Adaptive Computation, and Mixture-of-Experts Routing

This tutorial explores OpenMythos, a theoretical reconstruction enabling deeper reasoning in transformer models through...

MarkTechPost
AI galaxy hunters are adding to the global GPU crunch
Research & Papers

AI galaxy hunters are adding to the global GPU crunch

Astronomers are utilizing GPUs to identify distant galaxies in astronomical data, contributing to the growing global...

TechCrunch AI