Close Menu
TechBrunchTechBrunch
  • Home
  • AI
  • Apps
  • Crypto
  • Security
  • Startups
  • TechCrunch
  • Venture

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

Apple brings emergency satellite functionality to iPhone 13 with iOS 18.5

May 12, 2025

Even the A16Z VC says no one really knows what an AI agent is

May 12, 2025

Mercury CEO formalizes bets on early stage founders with a $26 million fund

May 12, 2025
Facebook X (Twitter) Instagram
TechBrunchTechBrunch
  • Home
  • AI

    OpenAI seeks to extend human lifespans with the help of longevity startups

    January 17, 2025

    Farewell to the $200 million woolly mammoth and TikTok

    January 17, 2025

    Nord Security founder launches Nexos.ai to help enterprises move AI projects from pilot to production

    January 17, 2025

    Data proves it remains difficult for startups to raise capital, even though VCs invested $75 billion in the fourth quarter

    January 16, 2025

    Apple suspends AI notification summaries for news after generating false alerts

    January 16, 2025
  • Apps

    Apple brings emergency satellite functionality to iPhone 13 with iOS 18.5

    May 12, 2025

    A flock of Whitney Wolf burns out – and bounces back

    May 10, 2025

    Google I/O 2025: What to expect including Gemini and Android 16 updates?

    May 9, 2025

    Epic Games and Spotify Test Apple's new app store rules

    May 9, 2025

    X Timeline is not updated for many users

    May 9, 2025
  • Crypto

    Stripe unveils AI Foundation model for payments, revealing a “deeper partnership” with Nvidia

    May 7, 2025

    Movie Pass explores the daily fantasy platform of film buffs

    May 1, 2025

    Speaking on TechCrunch 2025: Application is open

    April 24, 2025

    Revolut, a $45 billion Neobank, recorded a profit of $1 billion in 2024

    April 24, 2025

    The new kids show will come with a crypto wallet when it debuts this fall

    April 18, 2025
  • Security

    Five Things We Learned from WhatsApp vs. NSO Group Spyware Litigation

    May 10, 2025

    FBI and Dutch police seize and shut down hacked router botnets

    May 9, 2025

    Florida bill calling for encryption backdoors for social media accounts failed

    May 9, 2025

    Korean telephone giant SKT data breaches timeline

    May 8, 2025

    Powerschool paid the hacker ransom, but now the school says it's being forced

    May 8, 2025
  • Startups

    7 days left: Founders and VCs save over $300 on all stage passes

    March 24, 2025

    AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

    March 24, 2025

    20 Hottest Open Source Startups of 2024

    March 22, 2025

    Andrill may build a weapons factory in the UK

    March 21, 2025

    Startup Weekly: Wiz bets paid off at M&A Rich Week

    March 21, 2025
  • TechCrunch

    OpenSea takes a long-term view with a focus on UX despite NFT sales remaining low

    February 8, 2024

    AI will save software companies' growth dreams

    February 8, 2024

    B2B and B2C are not about who buys, but how you sell

    February 5, 2024

    It's time for venture capital to break away from fast fashion

    February 3, 2024

    a16z's Chris Dixon believes it's time to focus on blockchain use cases rather than speculation

    February 2, 2024
  • Venture

    Even the A16Z VC says no one really knows what an AI agent is

    May 12, 2025

    Mercury CEO formalizes bets on early stage founders with a $26 million fund

    May 12, 2025

    Google has launched a new initiative to help startups build AI

    May 12, 2025

    Saudi Arabian Prince launches AI ventures when Trump, Musk, Altman and Zuckerberg arrive at the meeting

    May 12, 2025

    This American VC is betting on European defence technology. That's still very rare

    May 12, 2025
TechBrunchTechBrunch

Databricks spent $10 million on new DBRX-generated AI model, but it couldn't beat GPT-4

TechBrunchBy TechBrunchMarch 27, 20246 Mins Read
Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
Share
Facebook Twitter LinkedIn Pinterest Telegram Email


If you wanted to raise the profile of a major technology company and you had $10 million to spend, what would you do with it? In a Super Bowl ad? F1 sponsor?

You can also spend it training generative AI models. Although not marketing in the traditional sense, generative models are attention-grabbing and increasingly focused on a vendor's fundamental products and services.

Check out Databricks' DBRX, a new generative AI model similar to OpenAI's GPT series and Google's Gemini, announced today. Available for research and commercial use on GitHub and the AI ​​development platform Hugging Face, DBRX Base and fine-tuned versions (DBRX Instruct) can be run and tuned on public, custom, or your own data. Masu.

“DBRX is useful and trained to provide information on a variety of topics,” Naveen Rao, Databricks' vice president of generative AI, told TechCrunch in an interview. “DBRX is optimized and tailored for English usage, but can speak and translate in a variety of languages, including French, Spanish, and German.”

Databricks describes DBRX as “open source,” similar to “open source” models like Meta's Llama 2 and AI startup Mistral's models. (Whether these models truly meet the definition of open source is a subject of intense debate.)

Databricks says it spent about $10 million and eight months training DBRX, and claims (quoted from a press release) that it is “outperforming.”[s] Evaluate all existing open source models against standard benchmarks. ”

But here's the marketing issue: Unless you're a Databricks customer, DBRX is very difficult to use.

This is because running DBRX in a standard configuration requires a server or PC with at least four Nvidia H100 GPUs. The H100 costs a few thousand dollars apiece, probably more. While this may be a big change for the average company, it's out of reach for many developers and solopreneurs.

There are also some small details. Databricks says companies with more than 700 million active users will face “certain limitations” comparable to Llama 2's Meta, ensuring all users use DBRX “responsibly” states that you must agree to the terms and conditions. (Databricks did not voluntarily provide details of these terms at the time of publication.)

Databricks offers the Mosaic AI Foundation Model product as a managed solution to these obstacles. In addition to running DBRX and other models, it provides a training stack for fine-tuning DBRX with custom data. Rao suggested that customers can use his Databricks Model Serving offering to privately host his DBRX or work with Databricks to deploy his DBRX on the hardware of their choice. did.

Rao added:

Ultimately, the benefit for Databricks is more users on the platform, as we are focused on making the Databricks platform the best choice for building customized models. DBRX demonstrates a best-in-class pre-training and tuning platform that allows customers to build their own models from scratch. This is an easy way for customers to start using his Databricks Mosaic AI-generated AI tools. DBRX is also highly capable out of the box and can be tuned to perform better on specific tasks with better economics than larger closed models.

Databricks claims that DBRX runs up to 2x faster than Llama 2 thanks to its Mix of Experts (MoE) architecture. MoE, which DBRX shares with Llama 2, Mistral's new model, and Google's recently announced Gemini 1.5 Pro, essentially splits a data processing task into multiple subtasks and divides these subtasks into smaller, specialized Delegate to the “expert” model.

Most MoE models have eight experts. DBRX has 16, which improves quality, he says Databricks.

However, quality is relative.

Databricks claims that DBRX outperforms Llama 2 and Mistral models in certain language understanding, programming, math and logic benchmarks, but in most areas outside of niche use cases such as database programming. DBRX falls short of OpenAI's GPT-4, which is probably the leading generative AI model. Language production.

Rao acknowledges that DBRX has other limitations. This means that, like all generative AI models, despite Databricks' commitment to safety testing and red teaming, they can fall victim to “illusory” answers to queries. . Because the model is simply trained to associate words and phrases with specific concepts, its responses are not necessarily accurate if those associations are not completely accurate.

DBRX is also not multimodal, unlike recent flagship generative AI models such as Gemini. (It can only process and generate text, not images.) Also, we don't know exactly what data sources were used for training. Rao only clarified that in training DBRX he did not use Databricks customer data.

“We trained DBRX on large datasets from a variety of sources,” he added. “We used open data sets that the community knows, loves, and uses every day.”

I asked Rao if any of the DBRX training datasets were copyrighted or licensed, or if there were any obvious signs of bias (such as racial bias). , he didn't answer directly, just said: He then conducted a red team exercise to improve the model's weaknesses. ” Generative AI models are prone to backflow of training data, which is a major concern for commercial users of models trained on unlicensed, copyrighted, or clearly biased data. In the worst-case scenario, users can unknowingly introduce IP infringement or biased work from their models into their projects, leading to ethical and legal issues.

Some companies that train and release generative AI models offer policies that cover legal costs resulting from potential infringement. Databricks currently does not. Rao said the company is “exploring scenarios” where that could happen.

Considering this and other aspects where DBRX misses the mark, this model seems like a tough sell to anyone other than current or future Databricks customers. Databricks' competitors in generative AI, including OpenAI, offer comparable, if not more attractive, technology at very competitive prices. And many generative AI models are closer to the commonly understood open source definition than DBRX.

Rao promises that Databricks will continue to improve DBRX and release new versions as the company's Mosaic Labs R&D team (the team behind DBRX) explores new generative AI avenues.

“DBRX is advancing the field of open source models and challenging ourselves to build future models more efficiently,” he said. “We will continue to release variants as we apply techniques that improve output quality in terms of reliability, safety, and bias. The open model is a platform where customers can build custom functionality using our tools. I think there is.”

Judging by DBRX's current position relative to its peers, it has a very long road ahead.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

OpenAI seeks to extend human lifespans with the help of longevity startups

January 17, 2025

Farewell to the $200 million woolly mammoth and TikTok

January 17, 2025

Nord Security founder launches Nexos.ai to help enterprises move AI projects from pilot to production

January 17, 2025

Data proves it remains difficult for startups to raise capital, even though VCs invested $75 billion in the fourth quarter

January 16, 2025

Apple suspends AI notification summaries for news after generating false alerts

January 16, 2025

Nvidia releases more tools and guardrails to help enterprises adopt AI agents

January 16, 2025

Leave A Reply Cancel Reply

Top Reviews
Editors Picks

7 days left: Founders and VCs save over $300 on all stage passes

March 24, 2025

AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

March 24, 2025

20 Hottest Open Source Startups of 2024

March 22, 2025

Andrill may build a weapons factory in the UK

March 21, 2025
About Us
About Us

Welcome to Tech Brunch, your go-to destination for cutting-edge insights, news, and analysis in the fields of Artificial Intelligence (AI), Cryptocurrency, Technology, and Startups. At Tech Brunch, we are passionate about exploring the latest trends, innovations, and developments shaping the future of these dynamic industries.

Our Picks

Apple brings emergency satellite functionality to iPhone 13 with iOS 18.5

May 12, 2025

Even the A16Z VC says no one really knows what an AI agent is

May 12, 2025

Mercury CEO formalizes bets on early stage founders with a $26 million fund

May 12, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

© 2025 TechBrunch. Designed by TechBrunch.
  • Home
  • About Tech Brunch
  • Advertise with Tech Brunch
  • Contact us
  • DMCA Notice
  • Privacy Policy
  • Terms of Use

Type above and press Enter to search. Press Esc to cancel.