Language Models
23 articles
BERT
Encoder-Only, Pretrained Language Models
Bidirectional language model
Natural Language Processing
BioBERT
Biomedical NLP, Healthcare AI, Transformers
BioGPT
Biomedical NLP, Healthcare AI, Microsoft Research
CamemBERT
Multilingual models, Natural Language Processing
DeepSeek-R1
DeepSeek, Reasoning Models
GPT-J
EleutherAI, Open-Source Models
Gemini Nano
Google, Mobile AI, On-Device AI
Gopher (language model)
AI Models, DeepMind
InstructGPT
Alignment, OpenAI, RLHF
LayoutLM
Document AI, Multimodal models
Llama 4
Meta AI
Longformer
Natural Language Processing, Transformer architecture
Mistral 7B
Mistral AI, Open-Source Models
OLMo
AI Models, Open Source AI
OPUS-MT
Machine translation, Natural Language Processing
SantaCoder
Code generation, Large Language Models
SciBERT
Domain-specific models, Natural Language Processing
Switch Transformer
Google, Mixture of Experts, Transformers
T5 (language model)
Encoder-Decoder Architectures
Unidirectional language model
Natural Language Processing
WordPiece
Natural Language Processing, Tokenization
XLM-RoBERTa
Multilingual models, Natural Language Processing