AI Market Logo
BTC $43,552.88 -0.46%
ETH $2,637.32 +1.23%
BNB $312.45 +0.87%
SOL $92.40 +1.16%
XRP $0.5234 -0.32%
ADA $0.8004 +3.54%
AVAX $32.11 +1.93%
DOT $19.37 -1.45%
MATIC $0.8923 +2.67%
LINK $14.56 +0.94%
HAIA $0.1250 +2.15%
BTC $43,552.88 -0.46%
ETH $2,637.32 +1.23%
BNB $312.45 +0.87%
SOL $92.40 +1.16%
XRP $0.5234 -0.32%
ADA $0.8004 +3.54%
AVAX $32.11 +1.93%
DOT $19.37 -1.45%
MATIC $0.8923 +2.67%
LINK $14.56 +0.94%
HAIA $0.1250 +2.15%
Amazon Nova Act SDK (preview): Path to production for browser automation agents
agentic-ai

Amazon Nova Act SDK (preview): Path to production for browser automation agents

Discover how Amazon Nova Act SDK enables reliable, scalable browser automation agents with AWS integrations for enterprise workflows.

July 28, 2025
5 min read
Amazon Web Services

Discover how Amazon Nova Act SDK enables reliable, scalable browser automation agents with AWS integrations for enterprise workflows.

Amazon Nova Act SDK (Preview): Path to Production for Browser Automation Agents

In early 2025, Amazon introduced the Amazon Nova Act SDK as a research preview to help developers build agents that reliably complete tasks in a web browser. Now, in a limited preview, customers can take their agents to production with new AWS integrations including AWS Identity and Access Management (IAM) for secure credentialing, Amazon Simple Storage Service (Amazon S3) for data storage and policy control, and the new Amazon Bedrock AgentCore Browser Tool for scalable, cloud-based browser execution. This article explores what makes the Amazon Nova Act SDK unique, how it works, and how teams across industries are already using it to automate browser-based workflows at scale.

Challenges with Traditional Automated Business Workflows

Many business operations require browser interaction, such as submitting time-off requests, processing invoices, accessing vendor portals, or reviewing dashboards. Due to limited API coverage, these workflows are often manual, involving repetitive copy-pasting, multi-step flows, and navigating complex interfaces. Traditional rules-based browser automation frameworks struggle in dynamic web environments. Changes in page structure, like new form fields or dropdown options, often break brittle selectors, leading to high maintenance overhead. Moreover, these frameworks do not scale well; for example, verifying professional licenses across 50 different state websites requires building 50 unique automations. Humans adapt easily across different tools and interfaces—for instance, drafting an email in Outlook translates easily to Gmail. Large language models (LLMs), trained on millions of UI examples, offer similar pattern recognition for AI agents. These models have powered chatbots, summarization, and coding copilots by understanding language and reasoning across domains. The next phase, agentic AI, extends LLMs from understanding to execution, enabling completion of dynamic, multi-step workflows such as filling complex forms or interacting with evolving UIs at scale.

Agentic AI with the Amazon Nova Act SDK

The Amazon Nova Act SDK enables building and deploying reliable browser agents powered by the Amazon Nova Act model—fine-tuned for agentic AI. Trained with reinforcement learning and extensive browser interaction data, it executes workflows step-by-step with precision. The SDK supports installation with a single command, writing agents in Python and natural language, real-time debugging, and integration into continuous integration and delivery (CI/CD) pipelines. With enterprise-grade security, observability, and infrastructure through AWS, it offers a fast, flexible path from prototype to production. The SDK integrates with the Amazon Bedrock AgentCore Browser Tool, a fast, secure, cloud-based browser enabling AI agents to interact with websites at scale. Features include virtual machine-level isolation, federated identity integration, live viewing, AWS CloudTrail logging, and session replay for troubleshooting and compliance.

Benefits of the Amazon Nova Act SDK

Reliability: Build Robust Browser Automation

Developers can decompose complex workflows into atomic commands (e.g., collecting all form elements and returning required fields). The SDK allows detailed instructions (e.g., dismiss popups), API calls, and direct browser manipulation via Playwright for sensitive tasks like password entry. Python code can be interleaved for tests, breakpoints, assertions, and parallelization. Early enterprise workflows demonstrate over 90% reliability, including automated quality assurance, complex form handling, and process execution. Enhanced reasoning and recovery enable agents to adapt to UI changes and complete sequences accurately.

Speed-to-Market: Prototype to Production in Days

The SDK enables rapid automation development without brittle scripts. Features include:
  • Git-based CI/CD pipelines
  • Reusable agent modules
  • Playwright fallbacks for sensitive tasks
  • Threading and parallelization for speed
  • Agents fit into existing developer workflows, enabling quick transition from experimentation to production.

    Security: Enterprise-Grade Automation

    Integration with IAM controls access to the model like other AWS services. The SDK supports macOS, Linux, Windows, and WSL2 with isolated runtime environments and encryption for sensitive data. It is designed for enterprise reliability, observability, and security.

    See It in Action: Streamlining Financial Decisions

    In financial services, speed and accuracy in turning fragmented public data into insights are critical. The Amazon Nova Act SDK automates information gathering to support these needs.

    Where the Amazon Nova Act SDK Can Make an Impact

    Browser-based workflows are often manual and error-prone. The SDK automates these tasks, improving accuracy and freeing teams for higher-value work. Early customers are applying it across industries:

    Automated Data Entry and Form Filling

    Automates repetitive input across CRMs, HR tools, and finance platforms. For example, healthcare staff assist members with complex benefit applications, and public sector caseworkers re-enter household data across systems. The SDK navigates dynamic fields and popups reliably. Rackspace Technology and Alvee Health use the SDK to automate public benefits registration, expecting a 30% increase in successful registrations and up to tenfold improvement in caseload management.

    Customer Support Augmentation

    Automates backend browser tasks across retail, travel, and SaaS support workflows, allowing agents to focus on customer interactions. Navan uses the SDK to automate payment form filling across diverse vendors, reducing repetitive tasks and scaling operations efficiently.

    Automating High-Stakes Administrative Workflows

    Automates credential verification, identity checks, and compliance tasks across numerous third-party portals with inconsistent layouts. Automation Anywhere integrates the SDK into its Process Reasoning Engine to automate professional credential verification, enabling AI agents to reason through UI processes with human-like expertise.

    UX and QA Testing Across Dynamic Interfaces

    Simulates real user interactions on frequently changing websites. Teams write and update tests using natural language or Python, adapting quickly without brittle selectors. Tyler Technologies automates software testing with the SDK, converting manual test plans into automated suites rapidly, saving hundreds of hours and improving product quality.

    What’s Next for the Amazon Nova Act SDK

    Amazon is working with early customers to expand the SDK’s capabilities beyond browser workflows, applying reinforcement learning to more complex real-world tasks and deepening AWS ecosystem integration for faster prototyping and deployment.

    Get Started with the Amazon Nova Act SDK

    Technical leaders and developers can start prototyping with the research preview at Amazon Nova Act. The Amazon AGI Lab focuses on building AI agents capable of real-world actions across digital and physical environments. Learn more at Amazon AGI Labs. Contact AWS to express interest in productionizing your agent (limited preview).

    About the Authors

    Lori Knapp is a Principal Product Manager with Amazon Nova, leading product efforts to define how foundation models power intelligent agents across diverse real-world use cases. Her background includes Alexa voice experiences, Microsoft product strategy, and behavioral science consulting. Tara Raj is an Engineering Manager at Amazon working on Nova Act, focusing on developer experience and adoption. She has over 10 years of engineering experience, including work on Nova multimodal capabilities, Windows Subsystem for Linux, and Visual Studio Code.

    Resources

  • Getting Started with AWS
  • What's New at AWS

  • Originally published at Amazon Web Services Blog on Mon, 28 Jul 2025.

    Frequently Asked Questions (FAQ)

    Understanding the Amazon Nova Act SDK

    Q: What is the primary purpose of the Amazon Nova Act SDK? A: The Amazon Nova Act SDK is designed to help developers build agents that can reliably complete tasks within a web browser. It provides tools and integrations for taking browser automation agents from prototype to production. Q: What are the key AWS integrations offered with the SDK? A: The SDK integrates with AWS Identity and Access Management (IAM) for secure credentialing, Amazon Simple Storage Service (Amazon S3) for data storage and policy control, and the Amazon Bedrock AgentCore Browser Tool for scalable, cloud-based browser execution. Q: How does the Nova Act model differ from traditional automation frameworks? A: Unlike traditional frameworks that rely on brittle selectors and struggle with dynamic web environments, the Nova Act model leverages large language models (LLMs) trained on extensive browser interaction data, enabling it to understand and execute complex, multi-step workflows with greater adaptability and precision.

    Benefits and Features

    Q: What are the main benefits of using the Amazon Nova Act SDK? A: The key benefits include enhanced reliability through agent reasoning and recovery, faster speed-to-market with rapid prototyping and CI/CD integration, and enterprise-grade security with IAM integration and isolated runtime environments. Q: How does the SDK ensure reliability in browser automation? A: Developers can decompose complex workflows into atomic commands, provide detailed instructions, and interleave Python code for tests, assertions, and parallelization. The model's enhanced reasoning and recovery capabilities allow agents to adapt to UI changes. Q: How does the SDK facilitate speed-to-market? A: Features like Git-based CI/CD pipelines, reusable agent modules, Playwright fallbacks, and threading/parallelization enable rapid development and quick transition from experimentation to production. Q: What security features are included? A: The SDK offers enterprise-grade security through IAM integration for access control, isolated runtime environments for macOS, Linux, and Windows, and data encryption for sensitive information.

    Use Cases and Applications

    Q: In which industries can the Amazon Nova Act SDK be applied? A: The SDK is applicable across various industries, including financial services, healthcare, public sector, retail, travel, SaaS, and any sector with browser-based workflows. Q: What types of browser-based workflows can be automated? A: The SDK can automate tasks such as data entry and form filling, invoice processing, accessing vendor portals, reviewing dashboards, credential verification, identity checks, compliance tasks, and UX/QA testing. Q: Can you provide examples of real-world applications? A: Examples include automating public benefits registration (Rackspace Technology, Alvee Health), automating payment form filling (Navan), professional credential verification (Automation Anywhere), and automating software testing (Tyler Technologies).

    Getting Started and Future Development

    Q: How can developers get started with the Amazon Nova Act SDK? A: Developers can start prototyping with the research preview available at Amazon Nova Act. For production use, interested customers should contact AWS. Q: What is the future direction for the Amazon Nova Act SDK? A: Amazon is working with early customers to expand its capabilities beyond browser workflows, applying reinforcement learning to more complex real-world tasks and deepening AWS ecosystem integration.

    Crypto Market's Take

    The advent of sophisticated tools like the Amazon Nova Act SDK signifies a pivotal shift in how businesses approach automation, particularly in dynamic environments like web browsers. This advancement mirrors the growing trend in the crypto space where AI is increasingly being integrated to enhance trading strategies, market analysis, and operational efficiency. At AI Crypto Market, we are at the forefront of this convergence, developing AI-driven solutions that leverage advanced machine learning for tasks ranging from automated trading bot execution to sophisticated market intelligence and portfolio management. Our platform aims to equip users with the tools to navigate the complexities of the digital asset landscape, much like the Nova Act SDK empowers developers to streamline browser-based workflows. The parallel between these technological advancements underscores the broader impact of AI in automating complex processes across various industries.

    More to Read:

  • AI Agents: Capabilities, Risks, and Growing Role
  • AI-Driven Crypto Trading Tools Reshape Market Strategies in 2025
  • Understanding Cryptocurrency Compliance