Close Menu
TechBrunchTechBrunch
  • Home
  • AI
  • Apps
  • Crypto
  • Security
  • Startups
  • TechCrunch
  • Venture

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

One of Elon Musk's longtime VCS is suing his former employer after allegedly fired

May 8, 2025

Korean telephone giant SKT data breaches timeline

May 8, 2025

AppFigures: Apple earned more than $10 billion from its US App Store commission last year

May 8, 2025
Facebook X (Twitter) Instagram
TechBrunchTechBrunch
  • Home
  • AI

    OpenAI seeks to extend human lifespans with the help of longevity startups

    January 17, 2025

    Farewell to the $200 million woolly mammoth and TikTok

    January 17, 2025

    Nord Security founder launches Nexos.ai to help enterprises move AI projects from pilot to production

    January 17, 2025

    Data proves it remains difficult for startups to raise capital, even though VCs invested $75 billion in the fourth quarter

    January 16, 2025

    Apple suspends AI notification summaries for news after generating false alerts

    January 16, 2025
  • Apps

    AppFigures: Apple earned more than $10 billion from its US App Store commission last year

    May 8, 2025

    Instagram thread gets video ads

    May 8, 2025

    Google deploys AI tools to protect Chrome users from fraud

    May 8, 2025

    Match to lay off 13% of staff

    May 8, 2025

    Apple tries to delay ruling that it will prohibit cutting payments for external apps

    May 8, 2025
  • Crypto

    Stripe unveils AI Foundation model for payments, revealing a “deeper partnership” with Nvidia

    May 7, 2025

    Movie Pass explores the daily fantasy platform of film buffs

    May 1, 2025

    Speaking on TechCrunch 2025: Application is open

    April 24, 2025

    Revolut, a $45 billion Neobank, recorded a profit of $1 billion in 2024

    April 24, 2025

    The new kids show will come with a crypto wallet when it debuts this fall

    April 18, 2025
  • Security

    Korean telephone giant SKT data breaches timeline

    May 8, 2025

    Powerschool paid the hacker ransom, but now the school says it's being forced

    May 8, 2025

    VC Company Insight Partners Review Personal Data Stolen During a January Hack

    May 8, 2025

    Crowdstrike says it will fire 500 workers

    May 7, 2025

    Ox Security lands fresh $60 million to scan code vulnerabilities

    May 7, 2025
  • Startups

    7 days left: Founders and VCs save over $300 on all stage passes

    March 24, 2025

    AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

    March 24, 2025

    20 Hottest Open Source Startups of 2024

    March 22, 2025

    Andrill may build a weapons factory in the UK

    March 21, 2025

    Startup Weekly: Wiz bets paid off at M&A Rich Week

    March 21, 2025
  • TechCrunch

    OpenSea takes a long-term view with a focus on UX despite NFT sales remaining low

    February 8, 2024

    AI will save software companies' growth dreams

    February 8, 2024

    B2B and B2C are not about who buys, but how you sell

    February 5, 2024

    It's time for venture capital to break away from fast fashion

    February 3, 2024

    a16z's Chris Dixon believes it's time to focus on blockchain use cases rather than speculation

    February 2, 2024
  • Venture

    One of Elon Musk's longtime VCS is suing his former employer after allegedly fired

    May 8, 2025

    Sequoia leads a $1.5 billion tender offer for sales automation startup clay

    May 8, 2025

    Bosch Ventures is turning attention to North America with a new $270 million fund

    May 8, 2025

    A comprehensive list of 2025 tech layoffs

    May 7, 2025

    Kapor Capital's managing partner Ulili Onovakpuri has left the company

    May 7, 2025
TechBrunchTechBrunch

Anthropique claims its latest model is best in class

TechBrunchBy TechBrunchJune 20, 20247 Mins Read
Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
Share
Facebook Twitter LinkedIn Pinterest Telegram Email


OpenAI rival Anthropic is releasing a powerful new generative AI model called Claude 3.5 Sonnet, but it's more of an incremental step than a breakthrough.

Claude 3.5 Sonnet can not only analyze both text and images, but also generate text. At least in theory, it is Anthropic's best model to date. Across several AI benchmarks for reading, coding, math and vision, Claude 3.5 Sonnet outperforms its successor, Claude 3 Sonnet, and outperforms Anthropic's previous flagship model, Claude 3 Opus.

Benchmarks aren't necessarily the most useful measure of AI progress, in part because many of them test esoteric edge cases that don't apply to the average person, like answering health questions, but for reference, Claude 3.5 Sonnet barely beat leading rival models, including OpenAI's recently released GPT-4o, in several benchmarks tested by Anthropic.

Alongside the new models, Anthropic is releasing something it's calling Artifacts, a workspace where users can edit and add content (such as code and documentation) generated by Anthropic's models. Currently in preview, Artifacts will get new features in the near future, such as the ability to collaborate with larger teams and store knowledge bases, Anthropic said.

Focus on efficiency

Claude 3.5 Sonnet performs a bit better than Claude 3 Opus, and Anthropic says the model is better at understanding nuanced and complex instructions, along with concepts like humor. (The AI ​​is notoriously not funny, though.) But perhaps more importantly for developers building Claude-powered apps that require quick responses (e.g., customer service chatbots), 3.5 Sonnet is faster, which Anthropic says is about twice as fast as 3 Opus.

According to Anthropic, vision (analysing photographs) is one area where Claude 3.5 Sonnet shows significant improvements over 3 Opus: 3.5 Sonnet is able to interpret charts and graphs more accurately and transcribe text from “imperfect” images, such as photographs with distortions and visual artifacts.

Michael Gerstenhaber, product lead at Anthropic, says these improvements are the result of architecture tweaks and new training data, including AI-generated data. Which data, specifically? Gerstenhaber wouldn't say, but he hinted that much of Claude 3.5 Sonnet's strength comes from these training sets.

Anthropic Claude 3.5 SonnetImage credit: Anthropic

“What's important is [businesses] “It's not about whether the AI ​​is competitive on benchmarks, it's about whether the AI ​​is helping you meet your business needs,” Gerstenhaber told TechCrunch. “And from that standpoint, we believe Claude 3.5 Sonnet will be a product that puts us a step ahead of anything else we offer, and we believe it will be a product that puts us ahead of anything else in the industry.”

Keeping the training data secret may be for competitive reasons, but it may also be to protect Anthropic from legal challenges, particularly those related to fair use: Courts have yet to decide whether vendors like Anthropic and competitors like OpenAI, Google, and Amazon have the right to train on public data, including copyrighted data, without paying or crediting the creators of that data.

So what we know is that Claude 3.5 Sonnet, like Anthropic's previous models, will be trained on large amounts of text and images, plus feedback from human testers to help the model be “tuned” to user intent and avoid spitting out harmful or problematic text.

Anthropic Claude 3.5 SonnetImage credit: Anthropic

What else do we know? Claude 3.5 Sonnet's context window (the amount of text the model can analyze before generating new text) is 200,000 tokens, the same as 3 Sonnet. Tokens are bits of raw data, like the syllables “fan,” “tas,” and “tic” in the word “fantastic.” 200,000 tokens equals roughly 150,000 words.

Claude 3.5 Sonnet is available starting today and is free for free users of Anthropic's web client and Claude iOS app. Subscribers to Anthropic's paid plans, Claude Pro and Claude Team, are subject to 5x rate limits. 3.5 Sonnet is also available via Anthropic's API and on managed platforms such as Amazon Bedrock and Google Cloud's Vertex AI.

“Claude 3.5 Sonnet delivers a major boost in intelligence without sacrificing speed and sets the foundation for future releases across the Claude model family,” said Gerstenhaber.

Claude 3.5 Sonnet also drives artifacts, which pop up dedicated windows in the Claude Web client when a user asks a model to generate content such as a code snippet, a text document, or a website design. Gerstenhaber explains: “An artifact is a model output that lets you set aside generated content and iterate on it. For example, if you want to generate code, the artifact is placed in the UI, and you can then interact with Claude to iterate over the document and improve it so that the code can be executed.”

Overall picture

So what is the significance of Claude 3.5 Sonnet in the broader context of anthropology and the AI ​​ecosystem?

Claude 3.5 Sonnet shows that, absent major research breakthroughs, we can only expect incremental progress on the model front for now. Over the past few months, there have been flagship releases from Google (Gemini 1.5 Pro) and OpenAI (GPT-4o) that have made modest advances in terms of benchmarks and qualitative performance. However, the robustness of today's model architectures and the massive compute required for training mean that we will not see a leap comparable to the one from GPT-3 to GPT-4 for a long time.

There are signs that investors are becoming wary of generative AI's longer-than-expected path to ROI, as generative AI vendors turn to data curation and licensing instead of promising new scalable architectures. Anthropic is somewhat insulated from this pressure because it's in the enviable position of being insurance against Amazon (and, to a lesser extent, Google) OpenAI. But the company's revenue is projected to reach just under $1 billion by the end of 2024, a fraction of OpenAI's. And Anthropic's backers won't let the company forget that fact.

Despite a growing client base that includes well-known brands like Bridgewater, Brave, Slack, and DuckDuckGo, Anthropic still lacks a certain name recognition with enterprises. It's telling that PwC recently partnered with OpenAI, not Anthropic, to resell its generative AI products to enterprises.

So Anthropic is moving in with a strategic and well-known approach, investing development time into products like Claude 3.5 Sonnet to achieve slightly better performance at commodity prices. 3.5 Sonnet is priced the same as 3 Sonnet: $3 per million tokens fed into the model, and $15 per million tokens generated by the model.

Gerstenhaber spoke about this in our conversation: “When building an application, the end user doesn't need to know what model is being used or how the engineers have optimized the experience for the user,” he said, “but the engineers have the tools to optimize that experience along whatever vector they need to optimize, and cost is definitely one of those vectors.”

Claude 3.5 Sonnet won't solve the hallucination problem. It will definitely get it wrong. But it might be attractive enough to entice developers and companies to switch to Anthropic's platform. That's what Anthropic cares about, after all.

Towards the same end, Anthropic has been focusing on tools like its experimental Steering AI, which allows developers to “steer” the inner workings of models, integrations that allow models to take actions within apps, and tools built on top of models, like the aforementioned Artifacts experience. It also hired an Instagram co-founder as its head of product. Additionally, it has expanded its product offering, recently bringing Claude into Europe, setting up offices in London and Dublin.

Ultimately, Anthropic seems to have come to the realization that as the feature gap between models narrows, building an ecosystem around models, rather than building models in isolation, will be the key to retaining customers.

Still, Gerstenhaber insisted that larger, more capable models like the Claude 3.5 Opus, with features like web search and preference memory, are on the way.

“I haven't heard of deep learning hitting a wall yet, and I'll leave it to researchers to speculate about where that wall might be, but I think it's too early to draw any conclusions, especially given the pace of innovation,” he said. “There's been very rapid development and rapid innovation, and I have no reason to believe that's going to slow down.”

Let's take a look.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

OpenAI seeks to extend human lifespans with the help of longevity startups

January 17, 2025

Farewell to the $200 million woolly mammoth and TikTok

January 17, 2025

Nord Security founder launches Nexos.ai to help enterprises move AI projects from pilot to production

January 17, 2025

Data proves it remains difficult for startups to raise capital, even though VCs invested $75 billion in the fourth quarter

January 16, 2025

Apple suspends AI notification summaries for news after generating false alerts

January 16, 2025

Nvidia releases more tools and guardrails to help enterprises adopt AI agents

January 16, 2025

Leave A Reply Cancel Reply

Top Reviews
Editors Picks

7 days left: Founders and VCs save over $300 on all stage passes

March 24, 2025

AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

March 24, 2025

20 Hottest Open Source Startups of 2024

March 22, 2025

Andrill may build a weapons factory in the UK

March 21, 2025
About Us
About Us

Welcome to Tech Brunch, your go-to destination for cutting-edge insights, news, and analysis in the fields of Artificial Intelligence (AI), Cryptocurrency, Technology, and Startups. At Tech Brunch, we are passionate about exploring the latest trends, innovations, and developments shaping the future of these dynamic industries.

Our Picks

One of Elon Musk's longtime VCS is suing his former employer after allegedly fired

May 8, 2025

Korean telephone giant SKT data breaches timeline

May 8, 2025

AppFigures: Apple earned more than $10 billion from its US App Store commission last year

May 8, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

© 2025 TechBrunch. Designed by TechBrunch.
  • Home
  • About Tech Brunch
  • Advertise with Tech Brunch
  • Contact us
  • DMCA Notice
  • Privacy Policy
  • Terms of Use

Type above and press Enter to search. Press Esc to cancel.