Close Menu
TechBrunchTechBrunch
  • Home
  • AI
  • Apps
  • Crypto
  • Security
  • Startups
  • TechCrunch
  • Venture

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

2 days left to save up to $210 with TC All Stage Pass

June 21, 2025

New Mathematics: Why seed investors have sold winners before

June 20, 2025

SNAP gets Saturn, a social calendar app for high school and university students

June 20, 2025
Facebook X (Twitter) Instagram
TechBrunchTechBrunch
  • Home
  • AI

    OpenAI seeks to extend human lifespans with the help of longevity startups

    January 17, 2025

    Farewell to the $200 million woolly mammoth and TikTok

    January 17, 2025

    Nord Security founder launches Nexos.ai to help enterprises move AI projects from pilot to production

    January 17, 2025

    Data proves it remains difficult for startups to raise capital, even though VCs invested $75 billion in the fourth quarter

    January 16, 2025

    Apple suspends AI notification summaries for news after generating false alerts

    January 16, 2025
  • Apps

    SNAP gets Saturn, a social calendar app for high school and university students

    June 20, 2025

    The X app code refers to the physical card that comes to X money

    June 20, 2025

    Deezer begins labeling AI-generated music to tackle streaming scams

    June 20, 2025

    New code for Spotify's apps refers to the much-anticipated “lossless” layer

    June 18, 2025

    Glitch turns the thread into a literal echo chamber

    June 18, 2025
  • Crypto

    Hackers steal and destroy millions of Iran's biggest crypto exchanges

    June 18, 2025

    Unique, a new social media app

    June 17, 2025

    xNotify Polymarket as partner in the official forecast market

    June 6, 2025

    Circle IPOs are giving hope to more startups waiting to be published to more startups

    June 5, 2025

    GameStop bought $500 million in Bitcoin

    May 28, 2025
  • Security

    Iran's government says it will shut down the internet to protect against cyber attacks

    June 20, 2025

    According to web surveillance companies, the internet will collapse across Iran

    June 18, 2025

    Pro-Israel hacktivist group claims responsiveness to alleged Iranian bank hacks

    June 17, 2025

    Pro-Israel Hacktivist Group has allegedly blamed for alleged Iranian bank hacks

    June 17, 2025

    As food shortages continue, UNFI says it is recovering from cyberattacks

    June 17, 2025
  • Startups

    7 days left: Founders and VCs save over $300 on all stage passes

    March 24, 2025

    AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

    March 24, 2025

    20 Hottest Open Source Startups of 2024

    March 22, 2025

    Andrill may build a weapons factory in the UK

    March 21, 2025

    Startup Weekly: Wiz bets paid off at M&A Rich Week

    March 21, 2025
  • TechCrunch

    OpenSea takes a long-term view with a focus on UX despite NFT sales remaining low

    February 8, 2024

    AI will save software companies' growth dreams

    February 8, 2024

    B2B and B2C are not about who buys, but how you sell

    February 5, 2024

    It's time for venture capital to break away from fast fashion

    February 3, 2024

    a16z's Chris Dixon believes it's time to focus on blockchain use cases rather than speculation

    February 2, 2024
  • Venture

    2 days left to save up to $210 with TC All Stage Pass

    June 21, 2025

    New Mathematics: Why seed investors have sold winners before

    June 20, 2025

    Boston Side Event Lineup TechCrunch, loyal private shares, Women Tech Meetups, 4 VC preparations and more

    June 20, 2025

    Pulley, 645 Venture, and Epigram Legal disrupt the 2025 agenda

    June 20, 2025

    3 more days to save up to $210 on TC All Stage Pass

    June 20, 2025
TechBrunchTechBrunch

Anthropique claims its latest model is best in class

TechBrunchBy TechBrunchJune 20, 20247 Mins Read
Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
Share
Facebook Twitter LinkedIn Pinterest Telegram Email


OpenAI rival Anthropic is releasing a powerful new generative AI model called Claude 3.5 Sonnet, but it's more of an incremental step than a breakthrough.

Claude 3.5 Sonnet can not only analyze both text and images, but also generate text. At least in theory, it is Anthropic's best model to date. Across several AI benchmarks for reading, coding, math and vision, Claude 3.5 Sonnet outperforms its successor, Claude 3 Sonnet, and outperforms Anthropic's previous flagship model, Claude 3 Opus.

Benchmarks aren't necessarily the most useful measure of AI progress, in part because many of them test esoteric edge cases that don't apply to the average person, like answering health questions, but for reference, Claude 3.5 Sonnet barely beat leading rival models, including OpenAI's recently released GPT-4o, in several benchmarks tested by Anthropic.

Alongside the new models, Anthropic is releasing something it's calling Artifacts, a workspace where users can edit and add content (such as code and documentation) generated by Anthropic's models. Currently in preview, Artifacts will get new features in the near future, such as the ability to collaborate with larger teams and store knowledge bases, Anthropic said.

Focus on efficiency

Claude 3.5 Sonnet performs a bit better than Claude 3 Opus, and Anthropic says the model is better at understanding nuanced and complex instructions, along with concepts like humor. (The AI ​​is notoriously not funny, though.) But perhaps more importantly for developers building Claude-powered apps that require quick responses (e.g., customer service chatbots), 3.5 Sonnet is faster, which Anthropic says is about twice as fast as 3 Opus.

According to Anthropic, vision (analysing photographs) is one area where Claude 3.5 Sonnet shows significant improvements over 3 Opus: 3.5 Sonnet is able to interpret charts and graphs more accurately and transcribe text from “imperfect” images, such as photographs with distortions and visual artifacts.

Michael Gerstenhaber, product lead at Anthropic, says these improvements are the result of architecture tweaks and new training data, including AI-generated data. Which data, specifically? Gerstenhaber wouldn't say, but he hinted that much of Claude 3.5 Sonnet's strength comes from these training sets.

Anthropic Claude 3.5 SonnetImage credit: Anthropic

“What's important is [businesses] “It's not about whether the AI ​​is competitive on benchmarks, it's about whether the AI ​​is helping you meet your business needs,” Gerstenhaber told TechCrunch. “And from that standpoint, we believe Claude 3.5 Sonnet will be a product that puts us a step ahead of anything else we offer, and we believe it will be a product that puts us ahead of anything else in the industry.”

Keeping the training data secret may be for competitive reasons, but it may also be to protect Anthropic from legal challenges, particularly those related to fair use: Courts have yet to decide whether vendors like Anthropic and competitors like OpenAI, Google, and Amazon have the right to train on public data, including copyrighted data, without paying or crediting the creators of that data.

So what we know is that Claude 3.5 Sonnet, like Anthropic's previous models, will be trained on large amounts of text and images, plus feedback from human testers to help the model be “tuned” to user intent and avoid spitting out harmful or problematic text.

Anthropic Claude 3.5 SonnetImage credit: Anthropic

What else do we know? Claude 3.5 Sonnet's context window (the amount of text the model can analyze before generating new text) is 200,000 tokens, the same as 3 Sonnet. Tokens are bits of raw data, like the syllables “fan,” “tas,” and “tic” in the word “fantastic.” 200,000 tokens equals roughly 150,000 words.

Claude 3.5 Sonnet is available starting today and is free for free users of Anthropic's web client and Claude iOS app. Subscribers to Anthropic's paid plans, Claude Pro and Claude Team, are subject to 5x rate limits. 3.5 Sonnet is also available via Anthropic's API and on managed platforms such as Amazon Bedrock and Google Cloud's Vertex AI.

“Claude 3.5 Sonnet delivers a major boost in intelligence without sacrificing speed and sets the foundation for future releases across the Claude model family,” said Gerstenhaber.

Claude 3.5 Sonnet also drives artifacts, which pop up dedicated windows in the Claude Web client when a user asks a model to generate content such as a code snippet, a text document, or a website design. Gerstenhaber explains: “An artifact is a model output that lets you set aside generated content and iterate on it. For example, if you want to generate code, the artifact is placed in the UI, and you can then interact with Claude to iterate over the document and improve it so that the code can be executed.”

Overall picture

So what is the significance of Claude 3.5 Sonnet in the broader context of anthropology and the AI ​​ecosystem?

Claude 3.5 Sonnet shows that, absent major research breakthroughs, we can only expect incremental progress on the model front for now. Over the past few months, there have been flagship releases from Google (Gemini 1.5 Pro) and OpenAI (GPT-4o) that have made modest advances in terms of benchmarks and qualitative performance. However, the robustness of today's model architectures and the massive compute required for training mean that we will not see a leap comparable to the one from GPT-3 to GPT-4 for a long time.

There are signs that investors are becoming wary of generative AI's longer-than-expected path to ROI, as generative AI vendors turn to data curation and licensing instead of promising new scalable architectures. Anthropic is somewhat insulated from this pressure because it's in the enviable position of being insurance against Amazon (and, to a lesser extent, Google) OpenAI. But the company's revenue is projected to reach just under $1 billion by the end of 2024, a fraction of OpenAI's. And Anthropic's backers won't let the company forget that fact.

Despite a growing client base that includes well-known brands like Bridgewater, Brave, Slack, and DuckDuckGo, Anthropic still lacks a certain name recognition with enterprises. It's telling that PwC recently partnered with OpenAI, not Anthropic, to resell its generative AI products to enterprises.

So Anthropic is moving in with a strategic and well-known approach, investing development time into products like Claude 3.5 Sonnet to achieve slightly better performance at commodity prices. 3.5 Sonnet is priced the same as 3 Sonnet: $3 per million tokens fed into the model, and $15 per million tokens generated by the model.

Gerstenhaber spoke about this in our conversation: “When building an application, the end user doesn't need to know what model is being used or how the engineers have optimized the experience for the user,” he said, “but the engineers have the tools to optimize that experience along whatever vector they need to optimize, and cost is definitely one of those vectors.”

Claude 3.5 Sonnet won't solve the hallucination problem. It will definitely get it wrong. But it might be attractive enough to entice developers and companies to switch to Anthropic's platform. That's what Anthropic cares about, after all.

Towards the same end, Anthropic has been focusing on tools like its experimental Steering AI, which allows developers to “steer” the inner workings of models, integrations that allow models to take actions within apps, and tools built on top of models, like the aforementioned Artifacts experience. It also hired an Instagram co-founder as its head of product. Additionally, it has expanded its product offering, recently bringing Claude into Europe, setting up offices in London and Dublin.

Ultimately, Anthropic seems to have come to the realization that as the feature gap between models narrows, building an ecosystem around models, rather than building models in isolation, will be the key to retaining customers.

Still, Gerstenhaber insisted that larger, more capable models like the Claude 3.5 Opus, with features like web search and preference memory, are on the way.

“I haven't heard of deep learning hitting a wall yet, and I'll leave it to researchers to speculate about where that wall might be, but I think it's too early to draw any conclusions, especially given the pace of innovation,” he said. “There's been very rapid development and rapid innovation, and I have no reason to believe that's going to slow down.”

Let's take a look.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

OpenAI seeks to extend human lifespans with the help of longevity startups

January 17, 2025

Farewell to the $200 million woolly mammoth and TikTok

January 17, 2025

Nord Security founder launches Nexos.ai to help enterprises move AI projects from pilot to production

January 17, 2025

Data proves it remains difficult for startups to raise capital, even though VCs invested $75 billion in the fourth quarter

January 16, 2025

Apple suspends AI notification summaries for news after generating false alerts

January 16, 2025

Nvidia releases more tools and guardrails to help enterprises adopt AI agents

January 16, 2025

Leave A Reply Cancel Reply

Top Reviews
Editors Picks

7 days left: Founders and VCs save over $300 on all stage passes

March 24, 2025

AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

March 24, 2025

20 Hottest Open Source Startups of 2024

March 22, 2025

Andrill may build a weapons factory in the UK

March 21, 2025
About Us
About Us

Welcome to Tech Brunch, your go-to destination for cutting-edge insights, news, and analysis in the fields of Artificial Intelligence (AI), Cryptocurrency, Technology, and Startups. At Tech Brunch, we are passionate about exploring the latest trends, innovations, and developments shaping the future of these dynamic industries.

Our Picks

2 days left to save up to $210 with TC All Stage Pass

June 21, 2025

New Mathematics: Why seed investors have sold winners before

June 20, 2025

SNAP gets Saturn, a social calendar app for high school and university students

June 20, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

© 2025 TechBrunch. Designed by TechBrunch.
  • Home
  • About Tech Brunch
  • Advertise with Tech Brunch
  • Contact us
  • DMCA Notice
  • Privacy Policy
  • Terms of Use

Type above and press Enter to search. Press Esc to cancel.