AI Market Logo
BTC $43,552.88 -0.46%
ETH $2,637.32 +1.23%
BNB $312.45 +0.87%
SOL $92.40 +1.16%
XRP $0.5234 -0.32%
ADA $0.8004 +3.54%
AVAX $32.11 +1.93%
DOT $19.37 -1.45%
MATIC $0.8923 +2.67%
LINK $14.56 +0.94%
HAIA $0.1250 +2.15%
BTC $43,552.88 -0.46%
ETH $2,637.32 +1.23%
BNB $312.45 +0.87%
SOL $92.40 +1.16%
XRP $0.5234 -0.32%
ADA $0.8004 +3.54%
AVAX $32.11 +1.93%
DOT $19.37 -1.45%
MATIC $0.8923 +2.67%
LINK $14.56 +0.94%
HAIA $0.1250 +2.15%
From OpenAI to Nvidia, researchers agree: AI agents have a long way to go
AI-agents

From OpenAI to Nvidia, researchers agree: AI agents have a long way to go

Top AI researchers agree AI agents hold promise but face major challenges before becoming reliable and autonomous in real-world tasks.

August 6, 2025
5 min read
Sharon Goldman

Top AI researchers agree AI agents hold promise but face major challenges before becoming reliable and autonomous in real-world tasks.

From OpenAI to Nvidia: Why AI Agents Still Have a Long Road Ahead

Only in the Bay Area does spending a Saturday geeking out about AI agents—alongside 2,000 students, researchers, and tech insiders crammed into UC Berkeley—feel like a totally normal weekend plan. At the day-long Agentic AI Summit, the atmosphere was less like an academic conference and more like Silicon Valley’s buzzy hotspot. The speaker lineup was impressive, featuring top AI researchers and scientists including Jakob Pachocki, chief scientist at OpenAI; Ed Chi, VP of research at Google DeepMind; Bill Dally, chief scientist at Nvidia; Ion Stoica, cofounder at Databricks & Anyscale and UC Berkeley professor; and Dawn Song, a pioneering UC Berkeley professor focused on AI security. The summit’s focus was AI agents—AI-powered systems that can complete tasks mostly autonomously by using other software tools. Imagine a chatbot that not only suggests a vacation itinerary but also books flights and hotel reservations. As my colleague Jeremy Kahn recently noted, this kind of automation has long been a C-suite dream. Over the past decade, companies embraced robotic process automation (RPA), software that automates repetitive tasks like cutting and pasting between databases. However, traditional RPA is inflexible and limited to narrow tasks. Agentic AI aims to be more flexible and powerful, adapting to various business needs. OpenAI CEO Sam Altman expressed optimism in a January 2025 blog post, stating, “We believe that, in 2025, we may see the first AI agents ‘join the workforce’ and materially change the output of companies.” Despite this hype, the overall message at the summit was cautious and grounded. AI agents remain far from fully reliable or autonomous. They often struggle to remember past interactions and can fail in real-world environments. Google DeepMind’s Ed Chi emphasized the gap between polished demos and what is needed for production-ready agents. Jakob Pachocki raised concerns about the safety, security, and trustworthiness of agentic systems, especially when deployed in sensitive or autonomous settings. Sherwin Wu, head of engineering at OpenAI API, admitted, “I still don’t think agents have really lived up to their promise. Certain more generic cases have worked, but my day-to-day work doesn’t really feel that different with agents.” While today’s AI agents don’t yet meet the massive hype—contrasting with claims like Salesforce CEO Marc Benioff’s recent statement that he might be the “last CEO of Salesforce who only managed humans”—there was still plenty of optimism at the summit. Ion Stoica highlighted infrastructure improvements that simplify building agentic systems. Bill Dally pointed to ongoing hardware advances that will enable more powerful and efficient agent behaviors. Several speakers noted “narrow wins” in specific domains such as coding assistance. Though AI agents face growing pains, the packed UC Berkeley ballroom showed the industry’s commitment to advancing these systems. The ultimate goal remains clear: AI agents capable of reliably operating in the real world. The payoff, researchers believe, will be worth the wait.

AI in the News

  • U.S. agency approves OpenAI, Google, Anthropic for federal AI vendor list. The General Services Administration added OpenAI's ChatGPT, Google's Gemini, and Anthropic's Claude to an approved AI vendor list to accelerate government adoption. Providers commit to responsible use and compliance with federal standards.
  • The AI spending boom’s impact on the U.S. economy. Big Tech’s $350 billion AI investment in 2025 is fueling data center construction and chip demand, potentially boosting GDP growth by up to 0.7%. Economists warn of risks if the AI boom slows.
  • AI sales tool Clay raises $100 million at a $3.1 billion valuation. Clay helps sales reps find leads and convert customers. The funding round was led by CapitalG, Alphabet’s investment arm.
  • Eye on AI Research

  • Google DeepMind’s Genie 3 creates real-time interactive simulations. Genie 3 generates dynamic virtual worlds from text prompts, allowing navigation and interaction in AI-generated environments. Access is currently limited to select researchers as responsible deployment is explored.

  • This article was originally published at Fortune on August 5, 2025.

    Frequently Asked Questions (FAQ)

    AI Agents and Their Capabilities

    Q: What exactly are AI agents as discussed in the article? A: AI agents are AI-powered systems designed to complete tasks largely autonomously by utilizing other software tools. They aim to go beyond simple task execution to a more proactive and integrated approach. Q: What are the current limitations of AI agents? A: Despite advancements, AI agents currently struggle with reliability and full autonomy. They often have difficulty remembering past interactions and can fail in real-world, complex environments. Safety, security, and trustworthiness are also significant concerns, especially for autonomous applications. Q: What is the difference between traditional RPA and agentic AI? A: Traditional Robotic Process Automation (RPA) automates repetitive, narrow tasks with limited flexibility. Agentic AI aims to be more flexible and powerful, adapting to a wider range of business needs and operating with a greater degree of autonomy. Q: What are some examples of potential applications for AI agents? A: An example provided is a chatbot that can not only suggest a vacation itinerary but also handle the booking of flights and hotel reservations. In a business context, they could automate complex workflows and client interactions.

    Industry Perspectives and Future Outlook

    Q: What was the overall sentiment at the Agentic AI Summit? A: While there was optimism about the future potential of AI agents, the prevailing message was cautious and grounded, acknowledging the significant challenges that still need to be overcome before widespread, reliable deployment. Q: What advancements are helping AI agents progress? A: Key advancements include infrastructure improvements that simplify the building of agentic systems and hardware progress that enhances the power and efficiency of agent behaviors. Specific domains like coding assistance have seen "narrow wins." Q: How might AI agents impact the workforce? A: OpenAI CEO Sam Altman believes that by 2025, AI agents could "join the workforce" and materially change company output. However, the current reality, as noted by OpenAI's Sherwin Wu, is that their day-to-day impact is not yet drastically different for many.

    Crypto Market AI's Take

    The discussion around AI agents highlights a critical phase in AI development, mirroring the evolution seen in the broader tech and finance sectors. Just as AI is being integrated into financial tools to enhance analysis and trading, the development of sophisticated AI agents promises to revolutionize how tasks are performed. At Crypto Market AI, we are actively exploring how these advancements can be applied to the cryptocurrency space, from automating complex trading strategies with our AI-powered trading bots to providing sophisticated market analysis. Our focus remains on leveraging AI to augment human capabilities, ensuring that these powerful tools are used responsibly and effectively, much like the cautious optimism expressed by experts at the Agentic AI Summit. For those interested in understanding the intersection of AI and finance, our insights into AI-driven crypto trading tools and market analysis provide a glimpse into this evolving landscape.

    More to Read:

  • AI Agents: Capabilities, Risks, and the Growing Role
  • The Rise of AI in Cryptocurrency Trading
  • Understanding AI-Powered Crypto Scams