Close Menu
TechBrunchTechBrunch
  • Home
  • AI
  • Apps
  • Crypto
  • Security
  • Startups
  • TechCrunch
  • Venture

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

Robinhood expands its footprint in Canada by getting Wonderfi

May 13, 2025

Marks & Spencer confirms that customer's personal data has been stolen in a hack

May 13, 2025

Alltrails debuts a $80 annual membership, including smart routes with AI

May 12, 2025
Facebook X (Twitter) Instagram
TechBrunchTechBrunch
  • Home
  • AI

    OpenAI seeks to extend human lifespans with the help of longevity startups

    January 17, 2025

    Farewell to the $200 million woolly mammoth and TikTok

    January 17, 2025

    Nord Security founder launches Nexos.ai to help enterprises move AI projects from pilot to production

    January 17, 2025

    Data proves it remains difficult for startups to raise capital, even though VCs invested $75 billion in the fourth quarter

    January 16, 2025

    Apple suspends AI notification summaries for news after generating false alerts

    January 16, 2025
  • Apps

    Alltrails debuts a $80 annual membership, including smart routes with AI

    May 12, 2025

    Apple brings emergency satellite functionality to iPhone 13 with iOS 18.5

    May 12, 2025

    A flock of Whitney Wolf burns out – and bounces back

    May 10, 2025

    Google I/O 2025: What to expect including Gemini and Android 16 updates?

    May 9, 2025

    Epic Games and Spotify Test Apple's new app store rules

    May 9, 2025
  • Crypto

    Robinhood expands its footprint in Canada by getting Wonderfi

    May 13, 2025

    Stripe unveils AI Foundation model for payments, revealing a “deeper partnership” with Nvidia

    May 7, 2025

    Movie Pass explores the daily fantasy platform of film buffs

    May 1, 2025

    Speaking on TechCrunch 2025: Application is open

    April 24, 2025

    Revolut, a $45 billion Neobank, recorded a profit of $1 billion in 2024

    April 24, 2025
  • Security

    Marks & Spencer confirms that customer's personal data has been stolen in a hack

    May 13, 2025

    Five Things We Learned from WhatsApp vs. NSO Group Spyware Litigation

    May 10, 2025

    FBI and Dutch police seize and shut down hacked router botnets

    May 9, 2025

    Florida bill calling for encryption backdoors for social media accounts failed

    May 9, 2025

    Korean telephone giant SKT data breaches timeline

    May 8, 2025
  • Startups

    7 days left: Founders and VCs save over $300 on all stage passes

    March 24, 2025

    AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

    March 24, 2025

    20 Hottest Open Source Startups of 2024

    March 22, 2025

    Andrill may build a weapons factory in the UK

    March 21, 2025

    Startup Weekly: Wiz bets paid off at M&A Rich Week

    March 21, 2025
  • TechCrunch

    OpenSea takes a long-term view with a focus on UX despite NFT sales remaining low

    February 8, 2024

    AI will save software companies' growth dreams

    February 8, 2024

    B2B and B2C are not about who buys, but how you sell

    February 5, 2024

    It's time for venture capital to break away from fast fashion

    February 3, 2024

    a16z's Chris Dixon believes it's time to focus on blockchain use cases rather than speculation

    February 2, 2024
  • Venture

    Even the A16Z VC says no one really knows what an AI agent is

    May 12, 2025

    Mercury CEO formalizes bets on early stage founders with a $26 million fund

    May 12, 2025

    Google has launched a new initiative to help startups build AI

    May 12, 2025

    Saudi Arabian Prince launches AI ventures when Trump, Musk, Altman and Zuckerberg arrive at the meeting

    May 12, 2025

    This American VC is betting on European defence technology. That's still very rare

    May 12, 2025
TechBrunchTechBrunch

Anthropic claims new model outperforms GPT-4

TechBrunchBy TechBrunchMarch 4, 20246 Mins Read
Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
Share
Facebook Twitter LinkedIn Pinterest Telegram Email


Anthropic, an AI startup backed by hundreds of millions in venture capital (and likely hundreds of millions more soon), today announced Claude, the latest version of its GenAI technology. And the company claims it's comparable to OpenAI's GPT-4 in terms of performance.

Anthropic's new Claude 3, called GenAI, is a family of models: Claude 3 Haiku, Claude 3 Sonnet, and Claude 3 Opus, with Opus being the most powerful. Anthropic claims that all offer “improved capabilities” in analytics and prediction, including GPT-4 (but not GPT-4 Turbo) and Google's Gemini 1.0 Ultra (but not Gemini 1.5 Pro). It shows improved performance on certain benchmarks compared to the model.

Notably, Claude 3 is Anthropic's first multimodal GenAI, meaning it can analyze text as well as images, similar to GPT-4 and some flavors of Gemini. Claude 3 can process photos, charts, graphs, technical diagrams, and draw from PDFs, slideshows, and other types of documents.

As a step up over some GenAI rivals, Claude 3 can analyze multiple images (up to 20) in a single request. This allows comparison and contrast of images, he says Anthropic.

However, Claude 3's image processing has its limits.

Anthropic makes the model unable to identify people. There is no doubt that you are wary of the ethical and legal implications. And the company claims that the Claude 3 is prone to making mistakes with “low-quality” images (less than 200 pixels), and is unable to provide accurate information for spatial reasoning (such as reading an analog clock face) or object counting. ) admits to struggling with tasks that involve number of objects in the image).

Humanity Claude 3

Image credit: Anthropic

Claude 3 also does not produce any artwork. At least for now, the model is strictly image analysis.

Anthropic said customers generally find that Claude 3 follows multi-step instructions better than previous versions, produces structured output in formats such as JSON, and is easier to use in languages ​​other than English. He says he can look forward to a conversation. Anthropic says Claude 3 should also refuse to answer questions less often, thanks to a “more nuanced understanding of requests.” And soon, Claude 3 will be able to cite the source of the answer to the question so that users can verify the question.

“Claude 3 tends to produce more expressive and engaging responses,” Anthropic writes in a support article. “[It’s] Easy to prompt and maneuver compared to traditional models. Users will find that they can achieve the desired results by using shorter, more concise prompts. ”

Some of these improvements come from Claude 3's expanded context.

A model's context, or context window, refers to the input data (such as text) that the model considers before producing output. Models with small context windows tend to “forget” even the most recent conversations, often going off topic and in problematic ways. As an added benefit, large-scale context models can better understand the narrative flow of the data they ingest and generate more context-rich responses (at least hypothetically).

According to Anthropic, Claude 3 will initially support a 200,000-token context window, which is approximately 150,000 words, and some customers will support a 1 million-token context window (approximately 700,000 words). This is on par with Google's latest GenAI model, the aforementioned Gemini 1.5 Pro, which also offers up to 1 million context windows.

Just because Claude 3 is an upgrade over its predecessor doesn't mean it's perfect.

In its technical white paper, Anthropic acknowledges that Claude 3 is not immune to the problems that plague other GenAI models: bias and hallucinations (i.e., hoaxes). Unlike some of his GenAI models, Claude 3 cannot search his web. The model can only answer questions using data before August 2023. Also, while Claude is multilingual, he is not as fluent in certain “low resource” languages ​​as he is in English.

However, Anthropic is promising frequent updates to Claude 3 in the coming months.

“We don't think we are nearing the limits of model intelligence. [enhancements] We will be making it available to the Claude 3 model family in the coming months,” the company wrote in a blog post.

Opus and Sonnet are currently available on the web, via Anthropic's development console and API, Amazon's Bedrock platform, and Google's Vertex AI. Haiku will also follow later this year.

The breakdown of charges is as follows:

Opus: $15 per million input tokens, $75 per million output tokens Sonnet: $3 per million input tokens, $15 per million output tokens Haiku: $0.25 per million input tokens, $1.25 per million output tokens dollar

That's Claude 3. But what’s the view at 30,000 feet?

Now, as we previously reported, Anthropic's ambition is to create “the next generation of algorithms for AI self-learning.” Such algorithms can be used to build virtual assistants that can reply to emails, perform research, generate art, books, etc. Some of these are already being trialled, such as GPT-4 and other large-scale language models.

Anthropic teased this in the aforementioned blog post, saying that Claude 3 will have enhanced out-of-the-box features, including the ability to interact with other systems, interactive coding, and “more advanced agent features.” They say they plan to add features to Claude 3. ”

This last part is a form of software agent that automates complex tasks, such as transferring data from documents to spreadsheets for analysis or automatically filling out expense reports and inputting them into accounting software. Reminds me of OpenAI's ambition to build. OpenAI already offers an API that allows developers to build “agent-like experiences” into their apps, and Anthropic seems keen to provide similar functionality.

Will we see an Anthropic image generator next? Frankly, that would surprise me. Image generators have been the subject of a lot of controversy lately, mainly for reasons related to copyright and prejudice. Google was recently forced to disable its image generator for injecting diversity into photos with a farcical disregard for historical context, and many image generator vendors have profited from their work by training GenAI. He is engaged in a legal battle with an artist who accuses him of cheating. without providing any credit or compensation.

I'm interested in the evolution of Anthropic's technology “Constitutional AI” for training GenAI. The company claims this makes it easier to understand the model's behavior and make adjustments if necessary. Constitutional AI aims to provide a way to tailor AI to human intent, with models responding to questions and performing tasks using a simple set of guidelines. For Claude 3, for example, Anthropic said that based on customer feedback, it added constitutional principles that direct the model to be understanding and accessible to people with disabilities.

Whatever Anthropic's end goal is, it's for the long haul. According to pitch materials leaked last May, the company is aiming to raise up to $5 billion over the next year or so, which is just the baseline needed to remain competitive with OpenAI. It may just be. (Training models isn't cheap, after all.) With $2 billion and his $4 billion in capital pledged from Google and Amazon respectively, the plan is well underway.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

7 days left: Founders and VCs save over $300 on all stage passes

March 24, 2025

AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

March 24, 2025

20 Hottest Open Source Startups of 2024

March 22, 2025

Andrill may build a weapons factory in the UK

March 21, 2025

Startup Weekly: Wiz bets paid off at M&A Rich Week

March 21, 2025

Wayve CEO shares his key elements for scaling autonomous driving technology

March 21, 2025

Leave A Reply Cancel Reply

Top Reviews
Editors Picks

7 days left: Founders and VCs save over $300 on all stage passes

March 24, 2025

AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

March 24, 2025

20 Hottest Open Source Startups of 2024

March 22, 2025

Andrill may build a weapons factory in the UK

March 21, 2025
About Us
About Us

Welcome to Tech Brunch, your go-to destination for cutting-edge insights, news, and analysis in the fields of Artificial Intelligence (AI), Cryptocurrency, Technology, and Startups. At Tech Brunch, we are passionate about exploring the latest trends, innovations, and developments shaping the future of these dynamic industries.

Our Picks

Robinhood expands its footprint in Canada by getting Wonderfi

May 13, 2025

Marks & Spencer confirms that customer's personal data has been stolen in a hack

May 13, 2025

Alltrails debuts a $80 annual membership, including smart routes with AI

May 12, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

© 2025 TechBrunch. Designed by TechBrunch.
  • Home
  • About Tech Brunch
  • Advertise with Tech Brunch
  • Contact us
  • DMCA Notice
  • Privacy Policy
  • Terms of Use

Type above and press Enter to search. Press Esc to cancel.