Recent changes
The 100 most recently updated articles. New pages start at v1; higher version numbers mean an existing article was revised.
Monday, June 29, 2026
- TARS Roboticsv6--- Full name February 5, 2025 Founders Beijing and Shanghai, China Industry TARS A Series (industrial), T Series (general-purpose), AWE foundation...
- Programming Custom GPTsv4See also: Custom GPTs, GPT Store, ChatGPT, and OpenAI Programming Custom GPTs are no-code, specialized versions of ChatGPT that users configure to write,...
- Node (decision tree)v3See also: Machine learning terms A node is the basic building block of a decision tree: it is a single point in the tree that is either a condition (an...
- chrFv2chrF is a machine translation evaluation metric that scores a candidate translation by counting the character n-grams it shares with one or more reference...
- Attribute samplingv3See also: Machine learning terms Attribute sampling is a randomization technique in which a decision tree considers only a small, randomly drawn subset of the...
- Saverv3See also: Machine learning terms In machine learning, a Saver is a utility or class that persists and restores the state of a model, its variables, and its...
- QuIP / QuIP#v2QuIP (Quantization with Incoherence Processing) is a family of weight-only post-training quantization methods for large language models developed in the...
- FlashMLAv9FlashMLA is an open-source GPU kernel from DeepSeek that accelerates the decoding step of Multi-head Latent Attention (MLA), the attention variant DeepSeek...
- Evol-Instructv2Evol-Instruct is a method for automatically generating large instruction tuning datasets by prompting a large language model to rewrite, or "evolve," existing...
- CIDErv2CIDEr (Consensus-based Image Description Evaluation) is an automatic evaluation metric for image captioning that scores a machine-generated caption by how...
- Splitv3See also: Machine learning terms A split in machine learning is a partitioning operation, and the word names two distinct things. As a data operation, a split...
- Rootv3See also: Machine learning terms In machine learning, the root is the starting node of a decision tree: the single topmost node that holds the entire training...
- Queuev3See also: Machine learning terms A queue in machine learning is a First-In-First-Out (FIFO) data structure that stages and buffers data between an input/output...
- Inference pathv3See also: Machine learning terms The inference path is the sequence of nodes that a single example visits as it travels from the root node of a decision tree...
- Clippingv5Clipping is a family of techniques in machine learning that constrain numerical values to lie within a specified range or below a specified magnitude. The most...
- Test lossv3See also: Machine learning terms Test loss is the value of a loss function computed on a held-out test data set: data that was used neither for training nor...
- Root directoryv3See also: Machine learning terms In machine learning, a root directory is the top level folder, on a local disk or in object storage, under which a training...
- Rank (Tensor)v3See also: Machine learning terms In machine learning and deep learning frameworks, the rank of a tensor is the number of dimensions (axes) it has: the count of...
- Menlo Venturesv2Menlo Ventures is an American venture capital firm founded in 1976 and headquartered in Menlo Park, California, with an office in San Francisco, making it one...
- Jakob Uszkoreitv3Jakob Uszkoreit is a German computer scientist, machine learning researcher, and entrepreneur, best known as one of the eight co-authors of the 2017 paper...
- SpaceX-xAI mergerv2The SpaceX-xAI merger was an all-stock acquisition, announced on February 2, 2026, in which the rocket and satellite company SpaceX acquired the artificial...
- Nick Frosstv2Nick Frosst is a Canadian artificial intelligence researcher, entrepreneur, and musician who co-founded the enterprise AI company Cohere in 2019 with Aidan...
- FACTS Groundingv2FACTS Grounding is a factuality benchmark from Google DeepMind and Google Research that measures whether a large language model answers a request using only...
- Dexmatev5Dexmate Inc. is an American robotics company based in Santa Clara, California, that builds AI-powered dexterous manipulation robots, most notably Vega, a...
- Causal scrubbingv2Causal scrubbing is a methodology in mechanistic interpretability for rigorously and quantitatively testing hypotheses about the internal computational...
Sunday, June 28, 2026
- Yang Zhilinv2Yang Zhilin (Chinese: 杨植麟; pinyin: Yáng Zhílín) is the co-founder and chief executive officer of Moonshot AI (月之暗面), the Beijing startup that develops the Kimi...
- Le Chat Enterprisev5Le Chat Enterprise is the enterprise edition of Le Chat, the AI assistant from Paris-based Mistral AI, launched on May 7, 2025 as a company-wide assistant that...
- Gated SAEv2A Gated sparse autoencoder (Gated SAE) is a sparse-autoencoder architecture for mechanistic interpretability that splits the encoder into a gating path, which...
- Fine-tune ChatGPT with Perplexity, Burstiness, Professionalism, Randomness and Sentimentality Guidev3See also: Guides, ChatGPT Guides and Prompt Engineering Guides The perplexity, burstiness, professionalism, randomness, and sentimentality guide is a prompt...
- DeepSeek Sparse Attention (DSA)v2DeepSeek Sparse Attention (DSA) is a trainable, fine-grained sparse attention mechanism introduced by the Chinese AI company DeepSeek in its experimental model...
- Zhang Peng (Zhipu AI)v3Zhang Peng (Chinese: 张鹏) is a Chinese computer scientist and technology executive who serves as co-founder and chief executive officer of Zhipu AI, the...
- Recursion Pharmaceuticalsv2Recursion Pharmaceuticals (NASDAQ: RXRX) is a clinical-stage TechBio company in Salt Lake City, Utah, that industrializes drug discovery by pairing artificial...
- Jacob Devlinv3Jacob Devlin is an American research scientist in natural language processing and machine learning, best known as the first author of BERT, the bidirectional...
- BERTScorev2BERTScore is an automatic, reference-based metric for evaluating text generation that scores a candidate sentence against one or more references by comparing...
- Attributev3See also: Machine learning terms In machine learning and data mining, an attribute is an individual measurable property of an object, observation, or example...
- TOPSTAR Groupv3Guangdong Topstar Technology Co., Ltd. Chinese name June 1, 2007 Founder Dalingshan Town, Dongguan City, Guangdong Province, China Industry Industrial...
- Lightning Attentionv2Lightning Attention is an IO-aware (input/output aware) implementation of linear attention that lets the method reach its theoretical linear-time complexity in...
- John Jumperv3John Jumper (born 1 January 1985) is an American computational chemist and biophysicist who led the development of AlphaFold, the artificial intelligence...
- Devicev3See also: Machine learning terms In machine learning, a device is the hardware target on which tensor operations are executed: a CPU, an NVIDIA GPU through...
- Aldebaran Roboticsv2Aldebaran Robotics, later known simply as Aldebaran, was a French robotics company founded in 2005 in Paris by entrepreneur Bruno Maisonnier, best known for...
- WebVoyagerv2WebVoyager is an end-to-end web agent and its companion benchmark, introduced by Hongliang He and seven coauthors in a paper accepted to ACL 2024 [1][2]. It is...
- JumpReLU SAEv3A JumpReLU sparse autoencoder (JumpReLU SAE) is a variant of the sparse autoencoder used in mechanistic interpretability whose encoder applies a learnable...
- Jonathan Hov2Jonathan Ho is a machine learning researcher best known as the lead author of "Denoising Diffusion Probabilistic Models" (DDPM), the 2020 paper that made...
- Foundationv5Foundation (also known as Foundation Future Industries, and formerly Foundation Robotics Labs or Foundation Future) is an American robotics company based in...
- Cyan Roboticsv5--- Full name Shanghai Qingxin Yichuang Technology Company Founded Niu Tengdi Headquarters Robotics, Humanoid robots Products TEAMS Design...
- Non-binary conditionv3See also: Machine learning terms In decision tree learning, a non-binary condition is a test at a node that has more than two possible outcomes, routing each...
- Native Sparse Attention (NSA)v2Native Sparse Attention (NSA) is a hardware-aligned, natively trainable sparse attention mechanism introduced in February 2025 by DeepSeek, in collaboration...
- Mirsee Roboticsv5--- Full name 2017 Founders 485 Pinebush Road, Suite 203, Cambridge, Ontario, Canada Industry MH1, MH2, MH3 humanoid robots Key technology ...
- Miles Brundagev2Miles Brundage is an American AI policy and governance researcher best known for leading policy research at OpenAI and serving as the company's senior advisor...
- Estun Automationv3Nanjing Estun Automation Co., Ltd. Chinese name March 1993 Founder Nanjing, Jiangsu, China Industry Wu Bo (Chairman), Wu Kan (President) Products ...
- Task arithmeticv2Task arithmetic is a model-editing technique that steers the behavior of a neural network by adding or subtracting vectors in its weight space. The central...
- Productivity Custom GPTsv4See also: Custom GPTs, GPT Store and ChatGPT "Increase your efficiency" , Official Description Productivity Custom GPTs are the category of user-built Custom...
- Łukasz Kaiserv3Łukasz Kaiser is a Polish computer scientist and researcher at OpenAI who is one of the eight co-authors of the 2017 paper "Attention Is All You Need," the...
- ALOHA 2v3ALOHA 2 is an open-source, low-cost bimanual teleoperation hardware platform released in February 2024 by an Google DeepMind led team working with the original...
- Agentic Commerce Protocolv2The Agentic Commerce Protocol (ACP) is an open technical standard, co-developed by OpenAI and Stripe and announced on September 29, 2025, that lets AI agents...
- Soumith Chintalav3Soumith Chintala is an Indian-American artificial intelligence engineer and researcher best known as the co-creator and long-time lead of PyTorch, the...
- NVFP4v3NVFP4 (NVIDIA FP4) is a 4-bit floating-point number format introduced by Nvidia with the Blackwell GPU architecture. It stores each value in just 4 bits using...
- Attention sinkv2An attention sink is an empirical phenomenon in Transformer language models in which a large fraction of each attention head's weight concentrates on a few...
- Xaira Therapeuticsv2Xaira Therapeutics is an American AI-driven drug discovery company based in South San Francisco, California, that launched publicly on April 23, 2024 with more...
- Voxtralv2Voxtral is a family of open-weight speech-understanding models released by Mistral AI on July 15, 2025, under the Apache 2.0 license. It ships in two sizes,...
- SimpleQA Verifiedv2SimpleQA Verified is a short-form factuality benchmark released by Google DeepMind and Google Research in September 2025 that measures the parametric knowledge...
- Chunked prefillv2Chunked prefill is a scheduling technique for large language model serving that splits the processing of a long input prompt (the prefill) into smaller,...
- Aaron Courvillev3Aaron Courville is a Canadian computer scientist, a full professor in the Department of Computer Science and Operations Research (DIRO) at the Universite de...
- xAI Colossusv3Colossus is an artificial-intelligence supercomputer and GPU training cluster operated by xAI in Memphis, Tennessee, and is widely described as one of the...
- Stable Audio 2.5v3Stable Audio 2.5 is an enterprise focused text-to-audio generation model released by Stability AI on September 10, 2025. It generates production ready music...
- Jonathan Hurstv2Jonathan W. Hurst is an American roboticist who is the co-founder and Chief Robot Officer of Agility Robotics, the company behind the Digit warehouse humanoid...
- Insilico Medicinev2Insilico Medicine is a clinical-stage biotechnology company that uses generative AI to discover disease targets and design small-molecule drugs end to end....
- Anton Korinekv3Anton Korinek is an Austrian-American economist and a professor in the Department of Economics and the Darden School of Business at the University of Virginia...
- PlayHTv2PlayHT, later rebranded PlayAI (and reachable at play.ht and play.ai), was an American generative AI voice company that built text-to-speech models, voice...
- NVIDIA Vera (CPU)v2NVIDIA Vera is a custom Arm-based data center central processing unit from NVIDIA, unveiled at the GTC Taipei keynote at COMPUTEX 2026 on June 1, 2026 and...
- NVIDIA DGX Cloudv2NVIDIA DGX Cloud is a managed AI-supercomputing-as-a-service offering from Nvidia that rents enterprises access to multi-node clusters of NVIDIA DGX...
- Genesis Missionv2The Genesis Mission is a United States federal science initiative, launched by an executive order that President Donald Trump signed on November 24, 2025, that...
- Coatue Managementv2Coatue Management is an American technology-focused investment firm founded in 1999 by Philippe Laffont, a former Tiger Management analyst and one of the...
- TIES-Mergingv2TIES-Merging is a training-free model merging method that combines several models fine-tuned from a shared pre-trained checkpoint into one multitask model...
- NVentures (Nvidia)v2NVentures is the corporate venture-capital arm of Nvidia, the dominant supplier of graphics processors used to train and run artificial-intelligence models....
- NVIDIA A800v2The NVIDIA A800 is a datacenter graphics processing unit that Nvidia created for the Chinese market in late 2022 as an export-compliant variant of the A100,...
- MuSRv3MuSR (Multistep Soft Reasoning) is a benchmark for evaluating multistep reasoning in large language models, built around long free-text narratives such as...
- Best-of-N samplingv2Best-of-N sampling (BoN) is an inference-time method that improves a large language model output by drawing N independent candidate responses to the same...
- Raine v. OpenAIv3Raine v. OpenAI, Inc. is a wrongful-death lawsuit filed on August 26, 2025, in the Superior Court of California for the County of San Francisco (case number...
- Illia Polosukhinv4Illia Polosukhin is a Ukrainian computer scientist and entrepreneur best known as one of the eight co-authors of the 2017 paper "Attention Is All You Need",...
- GLM-130Bv2GLM-130B is a 130-billion-parameter bilingual (English and Chinese) large language model released in August 2022 by the Knowledge Engineering Group (KEG) and...
- George Hotzv2George Hotz (born October 2, 1989), known online as "geohot," is an American software engineer, security researcher, and entrepreneur best known for being the...
- Diligent Roboticsv2Diligent Robotics is an American healthcare robotics company based in Austin, Texas, that builds Moxi, an autonomous mobile manipulation robot used by...
- LongBench v2v2LongBench v2 is a benchmark for evaluating how well large language models understand and reason over long contexts. It consists of 503 challenging...
- Index Venturesv2Index Ventures is an international venture capital firm founded in 1996 in Geneva, Switzerland, that backs technology startups from seed through late-stage...
- Emergent misalignmentv2Emergent misalignment is an AI safety finding, first reported in February 2025, in which fine-tuning a large language model on a single narrow bad behavior...
- Chain of Thought Monitorabilityv2Chain of thought monitorability is the property that lets safety researchers read a reasoning model's chain-of-thought (CoT), the step-by-step working it...
- Altimeter Capitalv2Altimeter Capital is a technology-focused investment firm, founded in 2008 by Brad Gerstner, that runs both a public-markets hedge fund and a series of private...
- Wojciech Zarembav2Wojciech Zaremba is a Polish computer scientist and a co-founder of OpenAI, the artificial intelligence research company started in December 2015, where he has...
- Massively Multilingual Speech (MMS)v2Massively Multilingual Speech (MMS) is an open-source speech project released by Meta AI in May 2023 that performs speech recognition and text-to-speech...
- Koray Kavukcuogluv2Koray Kavukcuoglu is a Turkish computer scientist who serves as Chief Technology Officer of Google DeepMind and, since June 2025, as Chief AI Architect of...
- Kling 3.0v2Kling 3.0 is the third-generation AI video generation model family released by Kuaishou on February 4, 2026, comprising four models (Video 3.0, Video 3.0 Omni,...
- Diederik Kingmav3Diederik Kingma is a Dutch machine learning researcher and a founding member of OpenAI who is best known as the first author of the Adam optimizer [1] and the...
- WeatherNext 2v2WeatherNext 2 is an artificial intelligence weather forecasting model from Google DeepMind and Google Research, announced on November 17, 2025, that generates...
- OPT (Open Pre-trained Transformer)v2OPT (Open Pre-trained Transformer) is a suite of decoder-only large language models released by Meta AI in May 2022, ranging from 125 million to 175 billion...
- Kimi K2.6v2Kimi K2.6 is an open-weight, trillion-parameter mixture of experts (MoE) large language model released by Moonshot AI on 20 April 2026 for agentic coding and...
- Gladiav3Gladia is a French artificial-intelligence company that builds audio infrastructure for developers and voice-product teams, centered on a speech-to-text...
- Brett Adcockv2Brett Adcock (born April 6, 1986) is an American technology entrepreneur who is the founder and chief executive officer of Figure AI, the Silicon Valley...
- NVIDIA Gracev2NVIDIA Grace is an Arm-based data-center central processing unit (CPU) from Nvidia, built from 72 Arm Neoverse V2 cores with co-packaged LPDDR5X memory and a...
- Kate Crawfordv2Kate Crawford (born 1974) is an Australian scholar, author, and artist known as one of the leading critical voices on the politics of artificial intelligence....