Research & Papers

Latest AI research

Study: Firms often use automation to control certain workers’ wages
Research & Papers

Study: Firms often use automation to control certain workers’ wages

MIT economists found US companies tend to target employees earning a “wage premium,” which increases inequality but not...

MIT News AI
OpenAI built a networking protocol with AMD, Broadcom, Intel, Microsoft, and NVIDIA to fix AI supercomputer bottlenecks
Research & PapersOpenAI

OpenAI built a networking protocol with AMD, Broadcom, Intel, Microsoft, and NVIDIA to fix AI supercomputer bottlenecks

OpenAI partnered with AMD, Broadcom, Intel, Microsoft, and NVIDIA to develop MRC, an open source networking protocol...

The Decoder
Using AI for Just 10 Minutes Might Make You Lazy and Dumb, Study Shows
Research & Papers

Using AI for Just 10 Minutes Might Make You Lazy and Dumb, Study Shows

A new study reveals that using AI assistants for even brief periods can negatively impact cognitive abilities and...

Wired AI
Closing the ‘Expressivity Gap’: How Mistral’s Voxtral TTS is Redefining Multilingual Voice Cloning with a Hybrid Autoregressive and Flow-Matching Architecture
Research & PapersMistral

Closing the ‘Expressivity Gap’: How Mistral’s Voxtral TTS is Redefining Multilingual Voice Cloning with a Hybrid Autoregressive and Flow-Matching Architecture

Mistral introduces Voxtral, a text-to-speech system using hybrid autoregressive and flow-matching architecture to...

MarkTechPost
Games people — and machines — play: Untangling strategic reasoning to advance AI
Research & Papers

Games people — and machines — play: Untangling strategic reasoning to advance AI

Assistant Professor Gabriele Farina conducts research on strategic decision-making in multi-agent scenarios, exploring...

MIT News AI
Unlocking large scale AI training networks with MRC (Multipath Reliable Connection)
Research & PapersOpenAI

Unlocking large scale AI training networks with MRC (Multipath Reliable Connection)

OpenAI introduces MRC (Multipath Reliable Connection), a new networking protocol designed to improve resilience and...

OpenAI Blog
Why Gradient Descent Zigzags and How Momentum Fixes It
Research & Papers

Why Gradient Descent Zigzags and How Momentum Fixes It

The article explains how momentum optimization improves gradient descent by reducing oscillations and speeding up...

MarkTechPost
A Coding Guide to Survey Bias Correction Using Facebook Research Balance with IPW CBPS Ranking and Post Stratification Methods
Research & PapersMeta

A Coding Guide to Survey Bias Correction Using Facebook Research Balance with IPW CBPS Ranking and Post Stratification Methods

A tutorial on correcting survey bias using Facebook Research's balance library, demonstrating multiple re-weighting...

MarkTechPost
Zyphra Introduces Tensor and Sequence Parallelism (TSP): A Hardware-Aware Training and Inference Strategy That Delivers 2.6x Throughput Over Matched TP+SP Baselines
Research & Papers

Zyphra Introduces Tensor and Sequence Parallelism (TSP): A Hardware-Aware Training and Inference Strategy That Delivers 2.6x Throughput Over Matched TP+SP Baselines

Zyphra introduces Tensor and Sequence Parallelism (TSP), a hardware-aware training and inference strategy that combines...

MarkTechPost
How to Build an End-to-End Production Grade Machine Learning Pipeline with ZenML, Including Custom Materializers, Metadata Tracking, and Hyperparameter Optimization
Research & Papers

How to Build an End-to-End Production Grade Machine Learning Pipeline with ZenML, Including Custom Materializers, Metadata Tracking, and Hyperparameter Optimization

This tutorial demonstrates how to build a production-grade machine learning pipeline using ZenML, covering custom...

MarkTechPost
A Coding Implementation to Explore and Analyze the TaskTrove Dataset with Streaming Parsing Visualization and Verifier Detection
Research & Papers

A Coding Implementation to Explore and Analyze the TaskTrove Dataset with Streaming Parsing Visualization and Verifier Detection

This tutorial demonstrates how to efficiently explore and analyze the TaskTrove dataset from Hugging Face using...

MarkTechPost
MIT study explains why scaling language models works so reliably
Research & Papers

MIT study explains why scaling language models works so reliably

MIT researchers have discovered that a phenomenon called superposition explains why large language model performance...

The Decoder
Sakana AI Introduces KAME: A Tandem Speech-to-Speech Architecture That Injects LLM Knowledge in Real Time
Research & Papers

Sakana AI Introduces KAME: A Tandem Speech-to-Speech Architecture That Injects LLM Knowledge in Real Time

Sakana AI has introduced KAME, a tandem speech-to-speech architecture that integrates LLM knowledge into conversational...

MarkTechPost
A New NVIDIA Research Shows Speculative Decoding in NeMo RL Achieves 1.8× Rollout Generation Speedup at 8B and Projects 2.5× End-to-End Speedup at 235B
Research & PapersGoogle

A New NVIDIA Research Shows Speculative Decoding in NeMo RL Achieves 1.8× Rollout Generation Speedup at 8B and Projects 2.5× End-to-End Speedup at 235B

NVIDIA Research has published a paper demonstrating speculative decoding integrated into NeMo RL with vLLM backend,...

MarkTechPost
A Coding Implementation of End-to-End Brain Decoding from MEG Signals Using NeuralSet and Deep Learning for Predicting Linguistic Features
Research & Papers

A Coding Implementation of End-to-End Brain Decoding from MEG Signals Using NeuralSet and Deep Learning for Predicting Linguistic Features

This tutorial demonstrates an end-to-end neuroAI pipeline that decodes linguistic features from MEG brain signals using...

MarkTechPost
Improving understanding with language
Research & Papers

Improving understanding with language

MIT senior Olivia Honeycutt investigates how the ways we communicate can shape our views of the world.

MIT News AI
Microsoft Research’s World-R1 Uses Flow-GRPO and 3D-Aware Rewards to Inject Geometric Consistency Into Wan 2.1 Without Architectural Changes
Research & PapersMicrosoft

Microsoft Research’s World-R1 Uses Flow-GRPO and 3D-Aware Rewards to Inject Geometric Consistency Into Wan 2.1 Without Architectural Changes

Microsoft Research introduces World-R1, a technique that uses Flow-GRPO and 3D-aware rewards to improve geometric...

MarkTechPost
Making the case for curiosity-driven science
Research & Papers

Making the case for curiosity-driven science

President Sally Kornbluth spoke in front of a packed crowd about growing challenges to the U.S. research ecosystem as...

MIT News AI
Top 10 KV Cache Compression Techniques for LLM Inference: Reducing Memory Overhead Across Eviction, Quantization, and Low-Rank Methods
Research & Papers

Top 10 KV Cache Compression Techniques for LLM Inference: Reducing Memory Overhead Across Eviction, Quantization, and Low-Rank Methods

The article discusses 10 KV cache compression techniques for optimizing Large Language Model inference, covering...

MarkTechPost