# Amazon Nova

> Source: https://aiwiki.ai/wiki/amazon_nova
> Updated: 2026-06-24
> Categories: AI Infrastructure, Artificial Intelligence, Generative AI, Large Language Models
> From AI Wiki (https://aiwiki.ai), a free encyclopedia of artificial intelligence. Quote with attribution.

**Amazon Nova** is a family of [foundation models](/wiki/foundation_models) developed by [Amazon](/wiki/amazon) and offered through [Amazon Bedrock](/wiki/amazon_bedrock), announced on December 3, 2024, at the AWS re:Invent conference in Las Vegas.[1][2] Amazon positions Nova on price-performance: it states that Nova Micro, Lite, and Pro are "at least 75 percent less expensive than the best performing models in their respective intelligence classes in Amazon Bedrock" and are also the fastest models in those classes, supporting tasks across 200 languages and multiple modalities.[1][2] The family includes text-only and [multimodal](/wiki/multimodal_ai) understanding models (Nova Micro, Nova Lite, Nova Pro, and Nova Premier), an image generation model (Nova Canvas), a video generation model (Nova Reel), a speech-to-speech model (Nova Sonic), and an agentic browser-action model and SDK (Nova Act). In December 2025, at re:Invent 2025, Amazon introduced the second generation of the family, Nova 2, with updated models and new capabilities including extended thinking and built-in tools.[18]

## When was Amazon Nova announced?

Amazon CEO Andy Jassy and AWS CEO Matt Garman introduced Amazon Nova at the 2024 AWS re:Invent keynote on December 3, 2024.[1][2] The announcement positioned Nova as Amazon's first-party answer to models from [OpenAI](/wiki/openai), [Google](/wiki/google_deepmind), [Anthropic](/wiki/anthropic), and [Meta](/wiki/meta_ai). Before Nova, Amazon had relied primarily on third-party models available through Bedrock (including Anthropic's [Claude](/wiki/claude), Meta's [Llama](/wiki/llama), and Mistral's models) rather than offering competitive in-house alternatives. The Titan model family, Amazon's earlier generation of proprietary models, had not gained significant traction against the competition.

Rohit Prasad, SVP of Amazon Artificial Intelligence, framed the launch around builder pain points, saying the new models "provide compelling intelligence and content generation while also delivering meaningful progress on latency, cost-effectiveness, customization, information grounding, and agentic capabilities."[21]

At launch, Nova Micro, Nova Lite, and Nova Pro were made generally available in Amazon Bedrock.[1] Nova Premier, the largest model in the family, was still in training at the time of announcement and was released later on April 30, 2025.[5][17]

The models are available exclusively through Amazon Bedrock, Amazon's managed service for accessing foundation models via API. Early enterprise adopters included SAP, which integrated Nova models into its SAP AI Core generative AI hub to power skills for Joule, SAP's AI copilot.

## Model lineup

The first-generation Amazon Nova family consists of several models, each targeting a different point on the cost-performance spectrum.

### Nova Micro

Nova Micro is a text-only model designed for the lowest latency and lowest cost among the Nova family.[1] It accepts text input and generates text output, with a context window of 128,000 tokens. Amazon optimized Micro for tasks such as text summarization, translation, content classification, interactive chat, brainstorming, and basic mathematical reasoning. Because it processes only text (no images or video), it is the fastest model in the lineup and the cheapest per token. At $0.035 per million input tokens and $0.14 per million output tokens, Micro is one of the least expensive commercial [large language model](/wiki/large_language_model) APIs on the market.[15] This makes it attractive for high-volume, latency-sensitive applications like real-time chatbots and content filtering pipelines.

### Nova Lite

Nova Lite is a low-cost multimodal model that processes text, images, video, and documents.[1] It supports a context window of 300,000 tokens, which is equivalent to roughly 30 minutes of video input. Amazon designed Lite as a budget-friendly option for multimodal workloads where speed and cost matter more than peak accuracy. Common use cases include document understanding, visual question answering, and video summarization. At $0.06 per million input tokens, Lite costs less than two cents per typical request, making it viable for processing large volumes of visual content in automated pipelines.[15]

### Nova Pro

Nova Pro is the mid-range multimodal model, positioned as offering the best combination of accuracy, speed, and cost across a broad range of tasks.[1] Like Lite, it supports text, image, and video inputs with a 300,000-token context window. Amazon reported that Nova Pro performed equal to or better than [GPT-4o](/wiki/gpt4) on 17 out of 20 benchmarks and outperformed [Gemini](/wiki/gemini) 1.5 Pro on 16 out of 21 benchmarks.[1][19] Nova Pro is particularly strong in agentic workflows that involve calling APIs and tools to complete multi-step tasks. On instruction-following evaluations, including the Comprehensive RAG Benchmark (CRAG) and the Berkeley Function Calling Leaderboard, Pro scored well against competing models in its price class.

In a summarization cost comparison published by Amazon, Nova Pro cost approximately $0.00068 per task compared to GPT-4o's $0.00122, a 44% cost reduction for equivalent work.[1] With throughput of approximately 200 tokens per second, Pro is also roughly twice as fast as GPT-4o on inference speed.

### Nova Premier

Nova Premier is the most capable model in the first-generation Nova family. It was released on April 30, 2025, after being announced as "still in training" at the December 2024 launch event.[5][17] Amazon had originally targeted Q1 2025 for Premier's release, but the model arrived at the end of April.[2] Premier supports a context window of 1,000,000 tokens, allowing it to process large codebases, long documents, and extended videos in a single prompt.[5] In addition to being the strongest Nova model for complex reasoning, Premier was designed to serve as a "teacher" model for [model distillation](/wiki/knowledge_distillation), transferring its capabilities to smaller, faster Nova models for specific use cases.[5]

On Amazon's internal benchmarks, Premier scored 87.4 on [MMLU](/wiki/mmlu) (undergraduate-level knowledge), 87.4 on [MMMU](/wiki/mmmu) (multimodal understanding), and 86.3 on [SimpleQA](/wiki/simpleqa).[4] On IFEval (instruction following), Premier scored 91.5%.[4] On the Berkeley Function Calling Leaderboard, Premier surpassed both Claude 3.5 Sonnet and Claude 3.7 Sonnet, though it fell short of [GPT-4](/wiki/gpt-4).5.

### How does Nova Premier compare with frontier models?

Independent evaluations found Premier trailing competitors on several harder tasks. On [SWE-bench](/wiki/swe_bench) Verified (a coding benchmark that tests a model's ability to resolve real GitHub issues), Premier achieved 42.4%.[4] While this exceeded GPT-4.5's score on the same benchmark, it fell short of both Claude Sonnet versions. Premier also scored lower than leading competitors on [GPQA Diamond](/wiki/gpqa_diamond) and [AIME 2025](/wiki/aime_2025), which test graduate-level science reasoning and math competition problems respectively. [Artificial Analysis](/wiki/artificial_analysis), an independent benchmarking organization, ranked Nova Premier 34th out of 63 models on its Intelligence Index, with a score of 19 out of a median of 22 for non-reasoning models.[17] The Nova family's main competitive advantage therefore lies in cost efficiency rather than raw intelligence on the most difficult evaluations.

## Architecture and training

Amazon published a technical report titled "The Amazon Nova Family of Models: Technical Report and Model Card" through Amazon Science in March 2025.[3] According to the report, Nova Micro, Lite, and Pro are all based on the [Transformer](/wiki/transformer) architecture. Amazon did not publicly disclose the parameter counts for any Nova model, nor did the report detail specific architectural modifications beyond the base Transformer design.[3]

### Pre-training

The models went through a multi-stage training process. [Pre-training](/wiki/pre-training) used a mixture of large-scale multilingual and multimodal data drawn from licensed data, proprietary data, open-source datasets, and publicly available data.[3] The training data covered over 200 languages, with particular emphasis on 15 languages: Arabic, Dutch, English, French, German, Hebrew, Hindi, Italian, Japanese, Korean, Portuguese, Russian, Simplified Chinese, Spanish, and Turkish. For the multimodal models (Lite, Pro, and Premier), the training data also included images and video alongside text.

### Post-training alignment

After pre-training, each model went through iterative fine-tuning stages.[3] The first stage was [Supervised Fine-Tuning](/wiki/fine_tuning) (SFT) on instruction-demonstration pairs, including multimodal examples for Lite and Pro. Next came reward model training from human preference data, where human annotators compared model outputs and indicated which responses were better. Finally, the models underwent preference learning using [Direct Preference Optimization](/wiki/direct_preference_optimization_dpo) (DPO) and [Proximal Policy Optimization](/wiki/reinforcement_learning_from_human_feedback) (PPO). These stages were applied iteratively, meaning the models went through multiple rounds of SFT, reward modeling, and preference optimization to progressively improve output quality.

### Training infrastructure

Training infrastructure relied on Amazon's custom [Trainium](/wiki/aws_trainium) chips (Trn1 instances), as well as [NVIDIA](/wiki/nvidia) A100 (P4d instances) and H100 (P5 instances) accelerators.[3] Distributed training was conducted on AWS SageMaker-managed Elastic Kubernetes Service (EKS) clusters, using Amazon FSx and [Amazon S3](/wiki/amazon_s3) for data storage and checkpoint I/O. The use of Amazon's own Trainium chips alongside NVIDIA GPUs reflects Amazon's broader strategy of developing custom silicon to reduce dependency on third-party chip suppliers and lower training costs.

## Benchmarks and performance

Amazon reported benchmark results for the Nova family in its technical report and launch blog posts.[1][3][6] The following table summarizes selected results from Amazon's own evaluations.

| Benchmark | Nova Micro | Nova Lite | Nova Pro | Nova Premier |
|---|---|---|---|---|
| MT-Bench (median) | 8.0 | 8.0 | 8.5 | 8.6 |
| [MMLU](/wiki/mmlu) | - | - | 85.9 | 87.4 |
| MMMU | - | - | - | 87.4 |
| SimpleQA | - | - | - | 86.3 |
| IFEval | - | - | - | 91.5% |
| SWE-bench Verified | - | - | - | 42.4% |
| HumanEval (pass@1) | - | - | 89.0 | - |
| Context window | 128K | 300K | 300K | 1M |

Note: Dashes indicate that specific scores were not disclosed by Amazon for that model-benchmark combination.

On [MT-Bench](/wiki/mt_bench), Amazon reported that Nova Premier had the most stable performance across evaluation categories, with a narrow min-max margin of 1.5 points, while Nova Pro showed greater variability with a min-max margin of 2.7 points.[6]

### How does Amazon Nova compare with competing models?

The following table compares Amazon Nova models against selected competing models based on publicly available data from Amazon's reports and third-party evaluations. Prices reflect on-demand rates as of 2025.

| Model | Provider | Modality | Context window | Input price (per 1M tokens) | Output price (per 1M tokens) | MMLU | SWE-bench Verified |
|---|---|---|---|---|---|---|---|
| Nova Micro | Amazon | Text | 128K | $0.035 | $0.14 | - | - |
| Nova Lite | Amazon | Multimodal | 300K | $0.06 | $0.24 | - | - |
| Nova Pro | Amazon | Multimodal | 300K | $0.80 | $3.20 | 85.9 | - |
| Nova Premier | Amazon | Multimodal | 1M | $2.50 | $12.50 | 87.4 | 42.4% |
| GPT-4o | [OpenAI](/wiki/openai) | Multimodal | 128K | $2.50 | $10.00 | 88.7 | - |
| [Claude 3.5 Sonnet](/wiki/claude) | [Anthropic](/wiki/anthropic) | Multimodal | 200K | $3.00 | $15.00 | 88.7 | 49.0% |
| [Gemini 1.5 Pro](/wiki/gemini) | [Google](/wiki/google_deepmind) | Multimodal | 2M | $1.25 | $5.00 | 85.9 | - |

Nova Pro and Nova Lite are notably cheaper than most competing models in their capability tiers. However, on harder benchmarks ([GPQA](/wiki/gpqa) Diamond, AIME 2025, [SWE-bench Verified](/wiki/swe-bench_verified)), Nova Premier trails the top models from Google, Anthropic, and OpenAI.[17] The Nova family's main competitive advantage lies in cost efficiency rather than raw intelligence on the most difficult evaluations.

## How much does Amazon Nova cost?

All Amazon Nova models are billed on a pay-as-you-go basis through Amazon Bedrock.[15] Pricing is calculated per 1,000 tokens for text models. Amazon offers three service tiers: Standard (regular rates), Priority (premium service with preferential compute allocation), and Flex (discounted pricing for workloads that tolerate delayed processing). Batch processing is available at a 50% discount from on-demand rates.[15]

### First-generation model pricing

| Model | Input (per 1M tokens) | Output (per 1M tokens) |
|---|---|---|
| Nova Micro | $0.035 | $0.14 |
| Nova Lite | $0.06 | $0.24 |
| Nova Pro | $0.80 | $3.20 |
| Nova Premier | $2.50 | $12.50 |

For cached prompts ([prompt caching](/wiki/prompt_engineering)), Nova Micro input tokens cost $0.00875 per million tokens, a 75% discount for repeated context.[15] This is useful for applications that reuse the same system prompt or reference documents across many requests.

### Creative content pricing

Nova Canvas image generation costs $0.04 per standard-quality image (up to 1024x1024) and $0.06 per premium-quality image.[15] Higher-resolution images (2048x2048) cost $0.06 (standard) and $0.08 (premium).

Nova Reel video generation costs approximately $0.08 per second of generated video at 1280x720 resolution.[15] A full 2-minute video generated with Reel 1.1 would therefore cost around $9.60.

## Amazon Nova Canvas

Nova Canvas is an image generation model that creates images from text prompts and image inputs.[1] It accepts English-language text prompts of up to 1,024 characters and produces images at resolutions up to 4.2 megapixels in any aspect ratio. The model competes with image generators such as [DALL-E](/wiki/dall_e) 3, [Stable Diffusion](/wiki/stable_diffusion), and [Midjourney](/wiki/midjourney).

Canvas supports several editing operations beyond basic text-to-image generation. Users can replace objects or backgrounds in existing images using text prompts, and a background removal feature is built in. The model can also be [fine-tuned](/wiki/fine_tuning) on proprietary data to generate images that match specific style guidelines or brand requirements. Amazon positions Canvas for use in advertising, marketing, and entertainment content creation.

For safety and traceability, Nova Canvas applies an invisible watermark to every generated image. Amazon developed this watermark to be "robust to alterations like rotation, resizing, color inversion, flipping, and other efforts to remove the watermark."[16] In addition, Canvas attaches C2PA (Coalition for Content Provenance and Authenticity) content credentials to generated images.[16] This metadata records the model, platform, and task type used, allowing downstream verification of image provenance. According to Amazon, the model's safety controls block 98.8% of potentially harmful prompts and 98.1% of toxic output content.[16] AWS also offers IP indemnification for Nova Canvas outputs, meaning Amazon assumes liability for intellectual property claims related to images generated by the model when used through Bedrock.

## Amazon Nova Reel

Nova Reel is a video generation model that produces short videos from text and image inputs.[1] At its initial launch in December 2024, Reel generated clips of up to 6 seconds at 1280x720 resolution and 24 frames per second. It competes with video generation models from [Runway](/wiki/runway_ml), [Pika](/wiki/pika), [Sora](/wiki/sora), and [Kling](/wiki/kling).

### Nova Reel 1.1

In April 2025, Amazon released Nova Reel 1.1, which extended the maximum video length to 2 minutes.[7] Videos are still composed of 6-second segments, but Reel 1.1 chains them together with style consistency across shots. The update introduced two generation modes:

- **Automated mode**: The user provides a single prompt of up to 4,000 characters, and the model automatically generates a multi-shot video reflecting that prompt.
- **Manual mode** (also called storyboard mode): The user specifies a separate prompt for each 6-second shot, with an optional starting image for each segment. This gives more direct control over shot composition and narrative structure.

Nova Reel also supports an image-to-video feature, where a reference image guides the generation of the video. Camera motion can be controlled through natural-language instructions in the prompt (for example, "slow pan left" or "zoom in on the subject"). Like Canvas, Reel includes built-in watermarking, content moderation, and C2PA content credentials.[16] Nova Reel 1.1 is available in the US East (N. Virginia) AWS Region through Amazon Bedrock.[7]

## Amazon Nova Sonic

Nova Sonic is a speech-to-speech foundation model announced on April 8, 2025.[8] Unlike traditional voice systems that chain together separate speech-to-text, language model, and text-to-speech components, Sonic is a unified model that processes speech input and generates speech output natively. Amazon describes its "novel architecture" as able to "adapt the intonation, prosody, and style of the generated speech response to align with the context and content of the speech input."[8] This reduces latency and enables more natural conversational dynamics, including real-time turn-taking. At launch, Sonic supported American and British English across various speaking styles and acoustic conditions, and it supports function calling and knowledge grounding via [retrieval-augmented generation](/wiki/retrieval_augmented_generation).[8]

### Nova 2 Sonic

At AWS re:Invent 2025 in December, Amazon announced Nova 2 Sonic with significant upgrades.[11] The updated model expanded language support to English, French, Italian, German, Spanish, Portuguese, and Hindi. Nova 2 Sonic introduced "polyglot voices," where a single voice (such as the Tiffany voice) can switch between all supported languages within one conversation without changing voice identity.

The model also improved its handling of alphanumeric inputs, short utterances, and telephony-quality (8 kHz) audio.[11] Asynchronous tool calling was added, allowing the model to continue responding to user input while tools execute in the background. This is particularly useful for call-center scenarios where the agent needs to look up account information while keeping the conversation going. Nova 2 Sonic integrates with Amazon Connect and third-party telephony providers including Vonage, Twilio, and AudioCodes, as well as open-source frameworks like LiveKit and Pipecat.

Pricing for Nova Sonic speech processing is $0.003 per 1,000 input tokens and $0.012 per 1,000 output tokens.[15]

## Amazon Nova Act

Nova Act is an AI model and SDK designed for browser automation.[9] It was first announced as a research preview on March 31, 2025, through nova.amazon.com, and reached general availability at AWS re:Invent in December 2025.[9][20] The underlying model is a custom variant of Nova 2 Lite, specifically trained for browser control.[18]

Nova Act is trained to perform actions within a web browser. The SDK allows developers to break complex workflows into atomic commands (for example: search for a product, complete a checkout, or answer questions about the current screen). Developers can also add detailed instructions to those commands, such as "don't accept the insurance upsell."[9] The SDK is installed with a single command (pip install nova-act) and agents are written in Python, combining natural-language instructions with code. The SDK supports interleaving Python logic (tests, breakpoints, asserts, thread pools for parallelization) with browser actions. For sensitive operations like password entry, developers can use direct [Playwright](/wiki/playwright)-based browser manipulation instead of the AI model.[9]

### How reliable is Nova Act?

Nova Act was trained with [reinforcement learning](/wiki/reinforcement_learning) and extensive in-domain browser interaction data. On ScreenSpot Web Text, a benchmark that measures how an AI agent interacts with text on a screen, Nova Act scored 94%, ahead of OpenAI's Computer-Using Agent (CUA) at 88% and Anthropic's Claude 3.7 Sonnet at 90%.[9] Amazon reports that the latest version demonstrates over 90% reliability across early enterprise workflows, including automated quality assurance, complex form handling, and process execution.[10]

At general availability, Nova Act integrates with AWS IAM for secure credentialing, Amazon S3 for data storage, and the Amazon Bedrock AgentCore Browser Tool for scalable cloud-based browser execution.[20] Agents can be deployed via Docker containers to Amazon ECR with automatic infrastructure setup. The service is priced at $4.75 per agent hour, billed for real-world elapsed time while agents work.[20] Human-in-the-loop wait time is excluded from billing. Parallel agents each generate separate charges.

## Fine-tuning and customization

Amazon provides several customization options for Nova models through Amazon Bedrock.[13]

### Fine-tuning

Nova Micro, Lite, and Pro all support fine-tuning with proprietary data.[13] Developers can fine-tune with text data (all three models) or multimodal data (Pro and Lite). Two fine-tuning approaches are available:

| Method | Description | Best for |
|---|---|---|
| Parameter-Efficient Fine-Tuning ([PEFT](/wiki/fine_tuning)) | Updates only a small subset of model parameters | Lightweight adaptation with limited training data |
| Full fine-tuning | Updates all model parameters | Extensive training datasets requiring maximum customization |

On-demand inference pricing for fine-tuned Nova models is the same as for the base models. Nova Canvas also supports fine-tuning for custom image styles.

### Model distillation

Amazon Bedrock supports [model distillation](/wiki/knowledge_distillation), where a larger "teacher" model transfers its knowledge to a smaller "student" model.[14] Nova Premier was designed specifically to serve as a teacher.[5] The process works as follows: the developer selects a teacher model (such as Premier) and a student model (such as Pro or Lite), provides use-case-specific prompts, and Bedrock generates responses from the teacher model for those prompts. It then fine-tunes the student model on the teacher's responses using automated data synthesis techniques.

Amazon reported that distilling Nova Pro using Nova Premier as the teacher model achieved 20% higher accuracy on tool selection and API calling tasks, while maintaining the speed and cost advantages of the smaller model.[5] This approach lets organizations get close to Premier-level quality for specific tasks at Pro-level pricing.

### Nova Forge

Announced at AWS re:Invent 2025, Nova Forge is an "open training" service that lets organizations build custom model variants starting from early Nova checkpoints.[18] Unlike standard fine-tuning, which adapts a finished model, Forge provides three checkpoints (pretrained, mid-trained, and post-trained) along with the recipes and code to continue training from any of them, allowing customers to influence the model's training from earlier stages. It supports API-based data mixing (blending proprietary data with the curated data used to train frontier models) and custom reward functions evaluated in proprietary environments.[18] Nova Forge targets specialized industries such as manufacturing, research and development, and media production. The service has a roughly $100,000 annual minimum subscription, with early customers including Booking.com, Cosine AI, Reddit, and Sony.[18]

## Amazon Nova 2 (second generation)

At AWS re:Invent 2025 on December 2, 2025, Amazon announced the Nova 2 model family.[18] The second generation includes four models.

| Model | Status | Context window | Key features |
|---|---|---|---|
| Nova 2 Lite | Generally available | 1M tokens | Cost-effective reasoning, extended thinking, built-in web grounding and code interpreter |
| Nova 2 Pro | Preview | 1M tokens | Complex multi-step tasks, multi-document analysis, video reasoning, software migrations |
| Nova 2 Sonic | Generally available | 1M tokens | Multilingual speech-to-speech, polyglot voices, telephony integration |
| Nova 2 Omni | Preview | 1M tokens | Text + image + video + speech input; text + image output; multi-speaker transcription |

A significant upgrade from the first generation is the expansion of context windows. Nova 2 Lite now supports 1 million tokens, up from 300,000 in the original Nova Lite.[18] All Nova 2 text and multimodal models support extended thinking with step-by-step reasoning and task decomposition. Three thinking intensity levels (low, medium, and high) give developers control over the trade-off between speed, intelligence, and cost. The models also include built-in tools such as a code interpreter and web grounding (the ability to search the web and incorporate results into responses), and they support remote MCP ([Model Context Protocol](/wiki/model_context_protocol)) tools.

Nova 2 Omni is particularly notable as what Amazon describes as the first reasoning model that accepts text, images, video, and speech inputs while generating both text and image outputs in a single model.[18] It supports over 200 languages and features character consistency and text rendering in generated images, along with multi-speaker transcription and native translation capabilities.

### Nova 2 pricing

| Model | Input (per 1K tokens) | Output (per 1K tokens) |
|---|---|---|
| Nova 2 Lite | $0.00125 | $0.0025 |
| Nova 2 Pro | $0.0003 (text/image/video/audio) | $0.01 (text) |
| Nova 2 Sonic (speech) | $0.003 | $0.012 |
| Nova 2 Sonic (text) | $0.00033 | $0.00275 |
| Nova 2 Omni (text/image/video input) | $0.0003 | $0.0025 (text), $0.04 (image) |

## How is Amazon Nova accessed?

All Amazon Nova models are accessed exclusively through [Amazon Bedrock](/wiki/amazon_bedrock), Amazon's fully managed service for foundation models.[1] Bedrock provides a unified API for invoking Nova models alongside third-party models from Anthropic, Meta, [Mistral](/wiki/mistral), [Cohere](/wiki/cohere), and others.

Through Bedrock, Nova models support several deployment and usage modes:

- **On-demand inference**: Pay-per-token pricing with no upfront commitments.
- **Batch inference**: Submit large sets of prompts for processing at a 50% discount from on-demand rates, with results delivered asynchronously.[15]
- **Provisioned throughput**: Reserve dedicated compute capacity for consistent, high-throughput workloads.
- **[Prompt](/wiki/prompt) caching**: Cache repeated context (such as system prompts or large documents) to reduce latency and cost on subsequent requests.
- **[Guardrails](/wiki/guardrails)**: Apply content filtering, PII detection, and topic-based blocking policies to model inputs and outputs.
- **Knowledge Bases**: Connect Nova models to enterprise data sources for [retrieval-augmented generation](/wiki/retrieval_augmented_generation) (RAG).
- **Agents**: Build multi-step agents that use Nova models to plan, call APIs, and execute tasks autonomously.

## Availability

Nova Micro, Lite, and Pro became generally available in Amazon Bedrock on December 3, 2024, initially in the US East (N. Virginia) AWS Region.[1] Nova Premier launched on April 30, 2025.[5] Nova Reel 1.1 became available in April 2025.[7] Nova Sonic launched on April 8, 2025, with Nova 2 Sonic following in December 2025.[8][11] Nova 2 Lite reached general availability in December 2025, while Nova 2 Pro and Nova 2 Omni remain in preview as of early 2026.[18]

Amazon has expanded regional availability over time, with additional AWS Regions gaining access based on demand.

## References

1. "Introducing Amazon Nova foundation models: Frontier intelligence and industry leading price performance." AWS News Blog, December 3, 2024. https://aws.amazon.com/blogs/aws/introducing-amazon-nova-frontier-intelligence-and-industry-leading-price-performance/
2. "Introducing Amazon Nova: A New Generation of Foundation Models." Amazon Press Center, December 3, 2024. https://press.aboutamazon.com/2024/12/introducing-amazon-nova-a-new-generation-of-foundation-models
3. "The Amazon Nova Family of Models: Technical Report and Model Card." Amazon Science, March 2025. https://www.amazon.science/publications/the-amazon-nova-family-of-models-technical-report-and-model-card
4. "Amazon Nova Premier: Technical Report and Model Card." Amazon Science, April 2025. https://www.amazon.science/publications/amazon-nova-premier-technical-report-and-model-card
5. "Amazon Nova Premier: Our most capable model for complex tasks and teacher for model distillation." AWS News Blog, April 30, 2025. https://aws.amazon.com/blogs/aws/amazon-nova-premier-our-most-capable-model-for-complex-tasks-and-teacher-for-model-distillation/
6. "Benchmarking Amazon Nova: A comprehensive analysis through MT-Bench and Arena-Hard-Auto." AWS Machine Learning Blog, 2025. https://aws.amazon.com/blogs/machine-learning/benchmarking-amazon-nova-a-comprehensive-analysis-through-mt-bench-and-arena-hard-auto/
7. "Amazon Nova Reel 1.1: Featuring up to 2-minutes multi-shot videos." AWS News Blog, April 2025. https://aws.amazon.com/blogs/aws/amazon-nova-reel-1-1-featuring-up-to-2-minutes-multi-shot-videos/
8. "Introducing Amazon Nova Sonic: Human-like voice conversations for generative AI applications." AWS News Blog, April 8, 2025. https://aws.amazon.com/blogs/aws/introducing-amazon-nova-sonic-human-like-voice-conversations-for-generative-ai-applications/
9. "Introducing Amazon Nova Act." Amazon AGI Labs, March 31, 2025. https://labs.amazon.science/blog/nova-act
10. "Amazon Nova Act SDK (preview): Path to production for browser automation agents." AWS Machine Learning Blog, 2025. https://aws.amazon.com/blogs/machine-learning/amazon-nova-act-sdk-preview-path-to-production-for-browser-automation-agents/
11. "Introducing Amazon Nova 2 Sonic: Our new speech-to-speech model for conversational AI." AWS News Blog, December 2025. https://aws.amazon.com/blogs/aws/introducing-amazon-nova-2-sonic-next-generation-speech-to-speech-model-for-conversational-ai/
12. "AWS re:Invent 2025: Every AI Announcement, Including Amazon Nova 2 and Kiro." Caylent Blog, December 2025. https://caylent.com/blog/aws-reinvent-2025-every-ai-announcement-including-amazon-nova-2-and-kiro
13. "Customizing Amazon Nova models." AWS Documentation. https://docs.aws.amazon.com/nova/latest/userguide/customization.html
14. "Distilling Amazon Nova models." AWS Documentation. https://docs.aws.amazon.com/nova/latest/userguide/customize-distill.html
15. "Amazon Bedrock Pricing." AWS. https://aws.amazon.com/bedrock/pricing/
16. "Amazon Nova and our commitment to responsible AI." Amazon Science, 2024. https://www.amazon.science/blog/amazon-nova-and-our-commitment-to-responsible-ai
17. "Amazon launches Nova Premier, its most capable AI model yet." TechCrunch, April 30, 2025. https://techcrunch.com/2025/04/30/amazon-launches-nova-premier-its-largest-ai-model-yet/
18. "Amazon introduces new frontier Nova models, a pioneering Nova Forge service, and Nova Act for building agents." About Amazon, December 2, 2025. https://www.aboutamazon.com/news/aws/aws-agentic-ai-amazon-bedrock-nova-models
19. "Amazon Nova Foundation Models: Comparison with OpenAI and Claude Model Family." Bind AI Blog, December 2024. https://blog.getbind.co/2024/12/10/amazon-nova-foundation-models-comparison-with-openai-and-claude-model-family/
20. "Build reliable AI agents for UI workflow automation with Amazon Nova Act, now generally available." AWS News Blog, December 2025. https://aws.amazon.com/blogs/aws/build-reliable-ai-agents-for-ui-workflow-automation-with-amazon-nova-act-now-generally-available/
21. "Amazon Nova: Meet our new foundation models in Amazon Bedrock." About Amazon, December 3, 2024. https://www.aboutamazon.com/news/aws/amazon-nova-artificial-intelligence-bedrock-aws

