Research & Papers · Other Companies

Sakana AI and NVIDIA Introduce TwELL with CUDA Kernels for 20.5% Inference and 21.9% Training Speedup in LLMs
Research & Papers

Sakana AI and NVIDIA Introduce TwELL with CUDA Kernels for 20.5% Inference and 21.9% Training Speedup in LLMs

Sakana AI and NVIDIA researchers demonstrated that L1 regularization can achieve over 99% sparsity in LLM feedforward...

MarkTechPost
Best Vector Databases in 2026: Pricing, Scale Limits, and Architecture Tradeoffs Across Nine Leading Systems
Research & Papers

Best Vector Databases in 2026: Pricing, Scale Limits, and Architecture Tradeoffs Across Nine Leading Systems

A comprehensive guide comparing nine leading vector database systems used as core infrastructure for RAG and agentic AI...

MarkTechPost
How to Build a Single-Cell RNA-seq Analysis Pipeline with Scanpy for PBMC Clustering, Annotation, and Trajectory Discovery
Research & Papers

How to Build a Single-Cell RNA-seq Analysis Pipeline with Scanpy for PBMC Clustering, Annotation, and Trajectory Discovery

This tutorial demonstrates an advanced single-cell RNA-seq analysis workflow using Scanpy on the PBMC-3k dataset,...

MarkTechPost
Five architects of the AI economy explain where the wheels are coming off
Research & Papers

Five architects of the AI economy explain where the wheels are coming off

Five key figures across the AI supply chain gathered at the Milken Global Conference to discuss critical challenges in...

TechCrunch AI
Study: Firms often use automation to control certain workers’ wages
Research & Papers

Study: Firms often use automation to control certain workers’ wages

MIT economists found US companies tend to target employees earning a “wage premium,” which increases inequality but not...

MIT News AI
Using AI for Just 10 Minutes Might Make You Lazy and Dumb, Study Shows
Research & Papers

Using AI for Just 10 Minutes Might Make You Lazy and Dumb, Study Shows

A new study reveals that using AI assistants for even brief periods can negatively impact cognitive abilities and...

Wired AI
Games people — and machines — play: Untangling strategic reasoning to advance AI
Research & Papers

Games people — and machines — play: Untangling strategic reasoning to advance AI

Assistant Professor Gabriele Farina conducts research on strategic decision-making in multi-agent scenarios, exploring...

MIT News AI
Why Gradient Descent Zigzags and How Momentum Fixes It
Research & Papers

Why Gradient Descent Zigzags and How Momentum Fixes It

The article explains how momentum optimization improves gradient descent by reducing oscillations and speeding up...

MarkTechPost
Zyphra Introduces Tensor and Sequence Parallelism (TSP): A Hardware-Aware Training and Inference Strategy That Delivers 2.6x Throughput Over Matched TP+SP Baselines
Research & Papers

Zyphra Introduces Tensor and Sequence Parallelism (TSP): A Hardware-Aware Training and Inference Strategy That Delivers 2.6x Throughput Over Matched TP+SP Baselines

Zyphra introduces Tensor and Sequence Parallelism (TSP), a hardware-aware training and inference strategy that combines...

MarkTechPost
How to Build an End-to-End Production Grade Machine Learning Pipeline with ZenML, Including Custom Materializers, Metadata Tracking, and Hyperparameter Optimization
Research & Papers

How to Build an End-to-End Production Grade Machine Learning Pipeline with ZenML, Including Custom Materializers, Metadata Tracking, and Hyperparameter Optimization

This tutorial demonstrates how to build a production-grade machine learning pipeline using ZenML, covering custom...

MarkTechPost
A Coding Implementation to Explore and Analyze the TaskTrove Dataset with Streaming Parsing Visualization and Verifier Detection
Research & Papers

A Coding Implementation to Explore and Analyze the TaskTrove Dataset with Streaming Parsing Visualization and Verifier Detection

This tutorial demonstrates how to efficiently explore and analyze the TaskTrove dataset from Hugging Face using...

MarkTechPost
MIT study explains why scaling language models works so reliably
Research & Papers

MIT study explains why scaling language models works so reliably

MIT researchers have discovered that a phenomenon called superposition explains why large language model performance...

The Decoder
Sakana AI Introduces KAME: A Tandem Speech-to-Speech Architecture That Injects LLM Knowledge in Real Time
Research & Papers

Sakana AI Introduces KAME: A Tandem Speech-to-Speech Architecture That Injects LLM Knowledge in Real Time

Sakana AI has introduced KAME, a tandem speech-to-speech architecture that integrates LLM knowledge into conversational...

MarkTechPost
A Coding Implementation of End-to-End Brain Decoding from MEG Signals Using NeuralSet and Deep Learning for Predicting Linguistic Features
Research & Papers

A Coding Implementation of End-to-End Brain Decoding from MEG Signals Using NeuralSet and Deep Learning for Predicting Linguistic Features

This tutorial demonstrates an end-to-end neuroAI pipeline that decodes linguistic features from MEG brain signals using...

MarkTechPost
Improving understanding with language
Research & Papers

Improving understanding with language

MIT senior Olivia Honeycutt investigates how the ways we communicate can shape our views of the world.

MIT News AI
Making the case for curiosity-driven science
Research & Papers

Making the case for curiosity-driven science

President Sally Kornbluth spoke in front of a packed crowd about growing challenges to the U.S. research ecosystem as...

MIT News AI
Top 10 KV Cache Compression Techniques for LLM Inference: Reducing Memory Overhead Across Eviction, Quantization, and Low-Rank Methods
Research & Papers

Top 10 KV Cache Compression Techniques for LLM Inference: Reducing Memory Overhead Across Eviction, Quantization, and Low-Rank Methods

The article discusses 10 KV cache compression techniques for optimizing Large Language Model inference, covering...

MarkTechPost
The MIT-IBM Computing Research Lab launches to shape the future of AI and quantum computing
Research & Papers

The MIT-IBM Computing Research Lab launches to shape the future of AI and quantum computing

MIT and IBM have launched a new computing research lab focused on the convergence of AI, algorithms, and quantum...

MIT News AI
smol-audio: A Colab-Friendly Notebook Collection for Fine-Tuning Whisper, Parakeet, Voxtral, Granite Speech, and Audio Flamingo 3
Research & Papers

smol-audio: A Colab-Friendly Notebook Collection for Fine-Tuning Whisper, Parakeet, Voxtral, Granite Speech, and Audio Flamingo 3

smol-audio is a collection of Google Colab-friendly notebooks designed for fine-tuning multiple audio AI models...

MarkTechPost