Close Menu
TechBrunchTechBrunch
  • Home
  • AI
  • Apps
  • Crypto
  • Security
  • Startups
  • TechCrunch
  • Venture

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

Destruction 2025 Builder's Stage Agenda is now alive and in shape

June 19, 2025

Kathy Gao brings a real playbook to every stage

June 19, 2025

At TC, Charles Hudson tells us what investors really see at every stage

June 19, 2025
Facebook X (Twitter) Instagram
TechBrunchTechBrunch
  • Home
  • AI

    OpenAI seeks to extend human lifespans with the help of longevity startups

    January 17, 2025

    Farewell to the $200 million woolly mammoth and TikTok

    January 17, 2025

    Nord Security founder launches Nexos.ai to help enterprises move AI projects from pilot to production

    January 17, 2025

    Data proves it remains difficult for startups to raise capital, even though VCs invested $75 billion in the fourth quarter

    January 16, 2025

    Apple suspends AI notification summaries for news after generating false alerts

    January 16, 2025
  • Apps

    New code for Spotify's apps refers to the much-anticipated “lossless” layer

    June 18, 2025

    Glitch turns the thread into a literal echo chamber

    June 18, 2025

    Facebook will soon roll out support for PassKeys for Android and iOS

    June 18, 2025

    Here's the first look at the rebooted digg

    June 18, 2025

    YouTube launches new shopping product stickers for shorts

    June 18, 2025
  • Crypto

    Hackers steal and destroy millions of Iran's biggest crypto exchanges

    June 18, 2025

    Unique, a new social media app

    June 17, 2025

    xNotify Polymarket as partner in the official forecast market

    June 6, 2025

    Circle IPOs are giving hope to more startups waiting to be published to more startups

    June 5, 2025

    GameStop bought $500 million in Bitcoin

    May 28, 2025
  • Security

    According to web surveillance companies, the internet will collapse across Iran

    June 18, 2025

    Pro-Israel hacktivist group claims responsiveness to alleged Iranian bank hacks

    June 17, 2025

    Pro-Israel Hacktivist Group has allegedly blamed for alleged Iranian bank hacks

    June 17, 2025

    As food shortages continue, UNFI says it is recovering from cyberattacks

    June 17, 2025

    UK Watchdog will fine 23andMe over 2023 data breach

    June 17, 2025
  • Startups

    7 days left: Founders and VCs save over $300 on all stage passes

    March 24, 2025

    AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

    March 24, 2025

    20 Hottest Open Source Startups of 2024

    March 22, 2025

    Andrill may build a weapons factory in the UK

    March 21, 2025

    Startup Weekly: Wiz bets paid off at M&A Rich Week

    March 21, 2025
  • TechCrunch

    OpenSea takes a long-term view with a focus on UX despite NFT sales remaining low

    February 8, 2024

    AI will save software companies' growth dreams

    February 8, 2024

    B2B and B2C are not about who buys, but how you sell

    February 5, 2024

    It's time for venture capital to break away from fast fashion

    February 3, 2024

    a16z's Chris Dixon believes it's time to focus on blockchain use cases rather than speculation

    February 2, 2024
  • Venture

    Destruction 2025 Builder's Stage Agenda is now alive and in shape

    June 19, 2025

    Kathy Gao brings a real playbook to every stage

    June 19, 2025

    At TC, Charles Hudson tells us what investors really see at every stage

    June 19, 2025

    Lock all TC stage passes for the remaining 4 days to save $210

    June 19, 2025

    No, Andreessen Horowitz did not post a tweet for Crypto Scam

    June 18, 2025
TechBrunchTechBrunch

New AWS service tackles AI illusions

TechBrunchBy TechBrunchDecember 3, 20244 Mins Read
Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
Share
Facebook Twitter LinkedIn Pinterest Telegram Email


Amazon Web Services (AWS), Amazon's cloud computing division, is announcing new tools to deal with hallucinations, scenarios in which AI models behave unreliably.

The service, Automated Reasoning checks, announced at AWS's re:Invent 2024 conference in Las Vegas, validates model responses by cross-referencing customer-provided information to ensure accuracy. AWS claims in a press release that automated inference checks are the “first and only” safeguard against hallucinations.

But it's, well…just be generous.

Automatic inference checking is similar to the correction feature Microsoft introduced this summer, which also flags AI-generated text that may be factually incorrect. Google also offers tools in its AI development platform, Vertex AI, that allow customers to “foundate” their models using data from third-party providers, their own datasets, or Google Search.

In either case, automated inference checks available through AWS's Bedrock model hosting service (specifically the Guardrails tool) can figure out how the model arrived at the answer and identify whether the answer is correct. I'll try. Customers upload information to establish some kind of truth, and automated inference checks create rules that can be adjusted and applied to models.

Once the model generates responses, automated inference checks validate them and, in cases of possible illusions, tease out the ground truth to get the correct answer. This answer is presented with likely misconceptions so the customer can see how far off the mark the model is.

AWS says PwC is already using automated inference checks to design AI assistants for its clients. And Swami Sivasubramanian, vice president of AI and data at AWS, suggested that these kinds of tools are exactly what attracts customers to Bedrock.

“With the introduction of these new capabilities, we are working on behalf of our customers to solve some of the biggest challenges facing the entire industry when moving generative AI applications into production,” he said in a statement. We are innovating,” he said. Bedrock's customer base has grown 4.7 times in the last year and has tens of thousands of customers, Sivasubramanian added.

But as one expert told me this summer, trying to eliminate hallucinations from generative AI is like trying to eliminate hydrogen from water.

AI models create hallucinations because they don't actually “know” anything. These are statistical systems that identify patterns in a set of data and predict which data will come next based on previously seen examples. This means that the model's response is not an answer, but rather a prediction, within error, of how the question should be answered.

AWS claims that its automated inference checks use “logically accurate” and “verifiable inferences” to reach conclusions. But the company did not voluntarily provide data showing that the tool was reliable.

In other Bedrock news, AWS announced Model Distillation this morning. This is a tool to migrate functionality from larger models (such as the Llama 405B) to smaller models (such as the Llama 8B) that are cheaper and run faster. Model Distillation, Microsoft's answer to Distillation in Azure AI Foundry, provides a way to experiment with different models without breaking the bank, AWS says.

The foundation of AWS re:Invent 2024Image credit: Frederic Lardinois/TechCrunch

“When a customer provides a sample prompt, Amazon Bedrock generates a response and performs all the work to fine-tune the model on a smaller scale,” AWS explained in a blog post. distillation process. ”

However, there are some caveats.

At this time, model distillation only works with models hosted on Bedrock in Anthropic and Meta. Customers must choose a large or small model from the same model “family”. You cannot choose models from different providers. And the extracted model loses some accuracy – “less than 2%,” AWS claims.

If you're okay with that, model distillation is now available in preview along with automatic inference checks.

“Multi-agent collaboration” is also available in preview. This is a new Bedrock feature that allows customers to assign AI to subtasks in large projects. As part of Bedrock Agents, AWS's contribution to the AI ​​agent craze, multi-agent collaboration provides tools to create and tune AI for things like reviewing financial records and assessing global trends. Masu.

Customers can also designate “supervisor agents” who split tasks and automatically route them to the AI. The supervisor is[give] “Specific agents have access to the information they need to complete their work,” AWS said.[determine] Which actions can be processed in parallel and which require details of other tasks before them? [an] Agent can proceed. ”

“Once all the technical expertise is provided, [AIs] Complete input, supervisor agent [can pull] information together [and] and synthesize the results,” AWS wrote in the post.

It's wonderful. However, as with all these features, we need to see how well it performs when deployed in the real world.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

New code for Spotify's apps refers to the much-anticipated “lossless” layer

June 18, 2025

Glitch turns the thread into a literal echo chamber

June 18, 2025

Facebook will soon roll out support for PassKeys for Android and iOS

June 18, 2025

Here's the first look at the rebooted digg

June 18, 2025

YouTube launches new shopping product stickers for shorts

June 18, 2025

Grifin secures $11 million to not intimidate investments in a female user base

June 18, 2025

Leave A Reply Cancel Reply

Top Reviews
Editors Picks

7 days left: Founders and VCs save over $300 on all stage passes

March 24, 2025

AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

March 24, 2025

20 Hottest Open Source Startups of 2024

March 22, 2025

Andrill may build a weapons factory in the UK

March 21, 2025
About Us
About Us

Welcome to Tech Brunch, your go-to destination for cutting-edge insights, news, and analysis in the fields of Artificial Intelligence (AI), Cryptocurrency, Technology, and Startups. At Tech Brunch, we are passionate about exploring the latest trends, innovations, and developments shaping the future of these dynamic industries.

Our Picks

Destruction 2025 Builder's Stage Agenda is now alive and in shape

June 19, 2025

Kathy Gao brings a real playbook to every stage

June 19, 2025

At TC, Charles Hudson tells us what investors really see at every stage

June 19, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

© 2025 TechBrunch. Designed by TechBrunch.
  • Home
  • About Tech Brunch
  • Advertise with Tech Brunch
  • Contact us
  • DMCA Notice
  • Privacy Policy
  • Terms of Use

Type above and press Enter to search. Press Esc to cancel.