Papers

	This page needs additional information.
	Key elements of this article are missing. You can help AI Wiki by expanding it.

Important Papers

Other Papers

Name	Date	Source	Type	Organization	Product	Note
Language Is Not All You Need: Aligning Perception with Language Models (Kosmos-1	2023/02/27	arxiv:2302.14045	Natural Language Processing	Microsoft	Kosmos-1
LLaMA: Open and Efficient Foundation Language Models	2023/02/25	paper blog post github	Natural Language Processing	Meta	LLaMA
Structure and Content-Guided Video Synthesis with Diffusion Models (Gen-1)	2023/02/06	arxiv:2302.03011 blog post	Vidoe-to-Video	Runway	Gen-1
Dreamix: Video Diffusion Models are General Video Editors	2023/02/03	arxiv:2302.01329 blog post		Google	Dreamix
FLAME: A small language model for spreadsheet formulas	2023/01/31	arxiv:2301.13779		Microsoft	FLAME
SingSong: Generating musical accompaniments from singing	2023/01/30	arxiv:2301.12662 blog post	Audio		SingSong
MusicLM: Generating Music From Text	2023/01/26	arxiv:2301.11325 blog post	Audio	Google	MusicLM
Mastering Diverse Domains through World Models (DreamerV3)	2023/01/10	arxiv:2301.04104v1 blogpost		DeepMind	DreamerV3
Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers (VALL-E)	2023/01/05	arxiv:2301.02111 Demo		Micorsoft	VALL-E
Muse: Text-To-Image Generation via Masked Generative Transformers	2023/01/02	arxiv:2301.00704 blog post	Computer Vision	Google	Muse
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model	2022/11/09	arxiv:2211.05100 Blog Post	Natural Language Processing	Hugging Face	BLOOM	Open source LLM that is a competitor to GPT-3
AudioLM: a Language Modeling Approach to Audio Generation	2022/09/07	arxiv:2209.03143 web page blog post	Audio	Google	AudioML
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding (Imagen)	2022/05/23	arxiv:2205.11487 Blog Post	Computer Vision	Google	Imagen
Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback	2022/04/12	arxiv:2204.05862 GitHub	Natural Language Processing	Anthropic	RLHF (Reinforcement Learning from Human Feedback)
PaLM: Scaling Language Modeling with Pathways	2022/04/05	arxiv:2204.02311 Blog Post	Natural Language Processing	Google	PaLM (Pathways Language Model)
Constitutional AI: Harmlessness from AI Feedback	2021/12/12	arxiv:2212.08073	Natural Language Processing	Anthropic	Constitutional AI, Claude
Improving language models by retrieving from trillions of tokens (RETRO)	2021/12/08	arxiv:2112.04426 Blog post	Natural Language Processing	OpenAI	RETRO (Retrieval Enhanced Transformer)
InstructPix2Pix: Learning to Follow Image Editing Instructions	2021/11/17	arxiv:2211.09800 Blog Post	Computer Vision	UC Berkley	InstructPix2Pix
REALM: Retrieval-Augmented Language Model Pre-Training	2020/02/10	arxiv:2002.08909 Blog Post	Natural Language Processing	Google	REALM (Retrieval-Augmented Language Model Pre-Training)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer (T5)	2019/10/23	arxiv:1910.10683	Natural Language Processing blog post	Google	T5 (Text-To-Text Transfer Transformer)
RoBERTa: A Robustly Optimized BERT Pretraining Approach	2019/07/26	arxiv:1907.11692	Natural Language Processing blog post	Meta	RoBERTa (Robustly Optimized BERT Pretraining Approach)
Probabilistic Face Embeddings	2019/04/21	arxiv:1904.09658	Computer Vision		PFEs (Probabilistic Face Embeddings)
Language Models are Unsupervised Multitask Learners (GPT-2)	2018	paper	Natural Language Processing	OpenAI	GPT-2