AgentsX
Posts
Silicon Valley Invests Heavily in Simulated Environments

Silicon Valley Invests Heavily in Simulated Environments

Silicon Valley Bets Big on ‘Environments’ to Train Next-Gen AI Agents.

Fred
September 22, 2025

What’s trending?

Training AI in Virtual Worlds: Silicon Valley’s New Bet
The AI Push Streamlining Loan Recovery for Major Banks
$15M Boost for AI That Learns on Its Own

The AI Agent Race Fuels a Gold Rush in Training Environment Technology

The next big leap in AI development, specifically for creating capable "AI agents" that can use software autonomously, hinges on a technique called reinforcement learning (RL) environments.

These are simulated workspaces (like a "boring video game") where AI agents can practice multi-step tasks, such as buying socks online, and learn from their mistakes. This is seen as the successor to the static, labeled datasets that powered the previous generation of AI chatbots.

The Rising Demand and the "Gold Rush"

There is a massive and growing demand for these RL environments from major AI labs like OpenAI, Anthropic, and Google. This demand has created a gold rush, leading to two types of companies entering the space:

Established Data-Labeling Firms: Companies like Scale AI, Surge, and Mercor are pivoting from labeling data to building RL environments to stay relevant.
New, Well-Funded Startups: Startups like Mechanize and Prime Intellect are focusing exclusively on this new field, with Mechanize aiming for a few high-quality environments and Prime Intellect creating an open-source platform for wider developers.

The hope for investors is that one of these companies becomes the next "Scale AI" but for RL environments.

How RL Environments Work?

An RL environment is a training simulation. For example, an agent might be tasked with completing a purchase on Amazon. The agent is rewarded when it succeeds and receives feedback when it fails.

The challenge is that these environments must be incredibly robust to handle the unpredictable ways an AI might fail, making them more complex to build than simple datasets.

While the concept isn't new (e.g., DeepMind's AlphaGo used RL in 2016), the current goal is different: to create generally capable AI agents using modern transformer models, which is a much more complicated undertaking.

Skepticism and Challenges

Despite the excitement, there are significant doubts about whether this approach will scale effectively. Key challenges include:

Reward Hacking: AI models are prone to finding shortcuts to get the reward without actually completing the task correctly.
Technical Difficulty: Building high-quality, functional environments is extremely hard, and many available ones require heavy modification.
Rapidly Changing Field: The fast pace of AI research means that today's cutting-edge technique could be obsolete tomorrow.
Broader Skepticism on RL: Some experts, like influential AI researcher Andrej Karpathy, are bullish on environments but bearish on reinforcement learning itself, questioning how much more progress it can yield.

In summary, while RL environments are currently the focal point of investment and research for developing advanced AI agents, it remains an open question whether this technique will successfully drive the next major wave of AI progress.

Major Banks Deploy AI Virtual Agents to Revolutionize Loan Recovery

Indian banks and non-banking financial companies (NBFCs) are increasingly using artificial intelligence (AI) to recover overdue loan payments.

Instead of a call from a human agent, customers who miss repayments may now receive a video call from an AI-generated avatar, often designed to look like a lawyer or a professional in formal attire.

Why AI is Being Adopted?

Financial institutions are making this shift for two main reasons:

Cost-Effectiveness: A human recovery agent handles about 250 cases per month at a significant salary cost. In contrast, an AI system can make up to 20 times more calls and is 40-60% cheaper to operate.
Efficiency and Effectiveness: AI avatars can work continuously without breaks, leading to more contact with borrowers. They are programmed to be firm, use legal language, and insist on repayment, but are designed to remain polite and avoid the harassment risks sometimes associated with human agents. This approach has shown an 80-85% success rate for recent defaults (up to 30 days overdue), particularly on personal and vehicle loans.

Current Implementation and Future Adoption

Currently, most private banks like ICICI Bank, HDFC Bank, and Yes Bank, along with NBFCs like L&T Finance, are using these AI tools from specialized service providers such as Credgenics, Oriserve, and Sarvam AI.

“AI is able to do fraud detection around payments by analyzing millions of data points at a faster rate than any human can, which is why many banks use AI for payments and fraud detection”, Ladi Asuni, Partner & Head; Tech Platforms, KPMG West Africa.
(1/2)
— KPMG Nigeria (@KPMG_NG)
10:29 AM • Sep 19, 2025

They typically use a combined approach: AI-powered video calls and messages are the first line of contact, followed by human agents for more difficult cases.

While public sector banks (PSBs) have not yet deployed this technology, they are beginning to explore it, both individually and through the PSB Alliance.

Adherence to Regulations

Lenders emphasize that these AI systems are programmed to strictly follow the Reserve Bank of India's (RBI) guidelines for debt collection.

This includes rules such as not contacting borrowers before 8 am or after 7 pm, and avoiding any form of intimidation, harassment, or invasion of privacy. This ensures the AI agents operate within legal and ethical boundaries.

Envive AI Raises $15 Million Series A to Power Self-Improving Agents

Envive AI, a company developing intelligent technology for digital commerce, has secured $15 million in a Series A funding round led by Fuse VC. This investment brings its total funding to $20 million.

The company specializes in creating safe, self-improving AI agents that help brands enhance every customer interaction, leading to improved business results and scalable growth.

This innovation addresses a major shift in consumer behavior. A McKinsey study reveals that over 25% of Gen Z shoppers already use AI for purchase decisions, a trend expected to grow. To adapt, brands must adopt an "agentic" approach.

Envive AI raises $15M in Series A funding led by Fuse VC to enhance digital commerce with self-improving AI agents!
@fuse_vc
— Uncommon Raises (@uncommonraises)
6:08 PM • Sep 17, 2025

Envive enables this with a reinforcement learning-based intelligence layer that learns from user behavior and coordinates multiple AI agents to achieve brand objectives throughout the customer journey.

Envive's system collects data from every touchpoint—from initial product discovery to post-purchase support. It uses this real-time information to ensure its AI agents collaborate, sharing context and adapting strategies to optimize each stage of the journey. As underlying AI models advance, this technology will allow brands to handle increasingly complex tasks.

Already, Envive has driven significant results for brands like Spanx and Supergoop!, improving website conversion rates, visibility in AI-powered search, and customer retention. The company is projected to grow fivefold by 2025.

Supported by investors like Fuse VC and Point72 Ventures, Envive's team combines deep expertise in AI and commerce, including a CEO who previously led generative AI initiatives at Walmart.

Cameron Borumand, General Partner at FUSE: “Shoppers are tired of endlessly scrolling websites. Envive AI enables brands to finally offer customers the better online experience they deserve.”
Aniket Deosthali, CEO of Envive AI: “Commerce is becoming agentic. Brands need more than basic AI tools; they need a system that learns from real-world behavior to drive results. We are building self-improving agents focused on performance and safety.”
Aniket Deosthali, on the company's mission: “Our goal is to create agents that learn and deliver outcomes. We are just beginning, but the future is already here.”

Stay with us. We drop insights, hacks, and tips to keep you ahead. No fluff. Just real ways to sharpen your edge.

What’s next? Break limits. Experiment. See how AI changes the game.

Till next time - keep chasing big ideas.

What's your take on our newsletter?

Thank you for reading