Research & Papers

Latest AI research

OpenAI shifts the boundary of automated reasoning with a "milestone in AI mathematics" that experts are now unpacking
Research & PapersOpenAI

OpenAI shifts the boundary of automated reasoning with a "milestone in AI mathematics" that experts are now unpacking

OpenAI's reasoning model disproved a 77-year-old mathematical conjecture by Paul Erdős using unexpected algebraic...

The Decoder
Meet Turbovec: A Rust Vector Index with Python Bindings, and Built on Google’s TurboQuant Algorithm
Research & PapersGoogle

Meet Turbovec: A Rust Vector Index with Python Bindings, and Built on Google’s TurboQuant Algorithm

Turbovec is a new Rust-based vector index with Python bindings that implements Google Research's TurboQuant algorithm...

MarkTechPost
How to Build Knowledge Graph Generation Pipelines From Text With kg-gen, NetworkX Analytics, and Interactive Visualizations
Research & Papers

How to Build Knowledge Graph Generation Pipelines From Text With kg-gen, NetworkX Analytics, and Interactive Visualizations

This tutorial demonstrates how to build knowledge graph generation pipelines from text using kg-gen, LiteLLM, and...

MarkTechPost
Research & PapersOpenAI

An OpenAI model has disproved a central conjecture in discrete geometry

An OpenAI model solved a longstanding 80-year-old unit distance problem in discrete geometry, disproving a major...

OpenAI Blog
Demis Hassabis said this might be the ‘foothills of the singularity.’ What?
Research & PapersGoogle

Demis Hassabis said this might be the ‘foothills of the singularity.’ What?

Google DeepMind CEO Demis Hassabis declared at Google I/O that the company's AI research represents a "profound moment...

The Verge AI
Justin Solomon appointed associate dean of engineering education
Research & Papers

Justin Solomon appointed associate dean of engineering education

MIT faculty member in electrical engineering and computer science to focus on innovation in engineering education and...

MIT News AI
Google’s Genie world model can now simulate real streets with Street View
Research & PapersGoogle

Google’s Genie world model can now simulate real streets with Street View

Google DeepMind has integrated Street View data with Project Genie, its world model, to create interactive simulations...

TechCrunch AI
Stochastic Gradient Descent (SGD’s) Frequency Bias and How Adam Fixes It 
Research & Papers

Stochastic Gradient Descent (SGD’s) Frequency Bias and How Adam Fixes It 

The article examines how stochastic gradient descent (SGD) exhibits frequency bias when training language models on...

MarkTechPost
Import AI 457: AI stuxnet; cursed Muon optimizer; and positive alignment
Research & Papers

Import AI 457: AI stuxnet; cursed Muon optimizer; and positive alignment

Import AI newsletter covering multiple AI research topics including AI security vulnerabilities (Stuxnet analogy),...

Import AI
NVIDIA Introduces a 4-Bit Pretraining Methodology Using NVFP4, Validated on a 12B Hybrid Mamba-Transformer at 10T Token Horizon
Research & PapersGoogle

NVIDIA Introduces a 4-Bit Pretraining Methodology Using NVFP4, Validated on a 12B Hybrid Mamba-Transformer at 10T Token Horizon

NVIDIA introduces NVFP4, a 4-bit pretraining methodology that combines selective BF16 layers, random Hadamard...

MarkTechPost
Introducing Google Antigravity 2.0
Research & Papers

Introducing Google Antigravity 2.0

DeepMind Blog
A Coding Implementation to Compress and Benchmark Instruction-Tuned LLMs with FP8, GPTQ, and SmoothQuant Quantization using llmcompressor
Research & Papers

A Coding Implementation to Compress and Benchmark Instruction-Tuned LLMs with FP8, GPTQ, and SmoothQuant Quantization using llmcompressor

In this tutorial, we explore how to apply post-training quantization to an instruction-tuned language model using...

MarkTechPost
Gemini for Science: AI experiments and tools for a new era of discovery
Research & PapersGoogle

Gemini for Science: AI experiments and tools for a new era of discovery

Google announces Gemini for Science, a collection of AI tools and experiments designed to enhance scientific research...

DeepMind Blog
Making it easier to understand how content was created and edited
Research & Papers

Making it easier to understand how content was created and edited

We're expanding our tools to help you understand how content was created and edited across the web.

DeepMind Blog
World Action Models give robots the ability to simulate consequences before they move
Research & Papers

World Action Models give robots the ability to simulate consequences before they move

World Action Models enable robots to simulate and understand how their actions change the world, addressing a key...

The Decoder
A Coding Guide Implementing SHAP Explainability Workflows with Explainer Comparisons, Maskers, Interactions, Drift, and Black-Box Models
Research & Papers

A Coding Guide Implementing SHAP Explainability Workflows with Explainer Comparisons, Maskers, Interactions, Drift, and Black-Box Models

This tutorial provides a practical guide to implementing SHAP explainability workflows for interpreting machine...

MarkTechPost
Nous Research Proposes Lighthouse Attention: A Training-Only Selection-Based Hierarchical Attention That Delivers 1.4–1.7× Pretraining Speedup at Long Context
Research & Papers

Nous Research Proposes Lighthouse Attention: A Training-Only Selection-Based Hierarchical Attention That Delivers 1.4–1.7× Pretraining Speedup at Long Context

Nous Research introduces Lighthouse Attention, a hierarchical attention mechanism that reduces computational complexity...

MarkTechPost
New benchmark confirms AI video generators look stunning but still can't reason about the world
Research & Papers

New benchmark confirms AI video generators look stunning but still can't reason about the world

WorldReasonBench, a new benchmark, evaluates AI video generators on physical and logical plausibility rather than image...

The Decoder
Researchers train AI model that hits near-full performance with just 12.5 percent of its experts
Research & Papers

Researchers train AI model that hits near-full performance with just 12.5 percent of its experts

Researchers at Allen Institute for AI and UC Berkeley developed EMO, a mixture-of-experts model that organizes experts...

The Decoder