Close Menu
TechBrunchTechBrunch
  • Home
  • AI
  • Apps
  • Crypto
  • Security
  • Startups
  • TechCrunch
  • Venture

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

Why investing in a growing AI startup is risky and more complicated

June 6, 2025

Humanity appoints national security experts to governing trusts

June 6, 2025

Italian lawmakers say Italy used spyware to target immigrant activists' mobile phones, but not for journalists

June 6, 2025
Facebook X (Twitter) Instagram
TechBrunchTechBrunch
  • Home
  • AI

    OpenAI seeks to extend human lifespans with the help of longevity startups

    January 17, 2025

    Farewell to the $200 million woolly mammoth and TikTok

    January 17, 2025

    Nord Security founder launches Nexos.ai to help enterprises move AI projects from pilot to production

    January 17, 2025

    Data proves it remains difficult for startups to raise capital, even though VCs invested $75 billion in the fourth quarter

    January 16, 2025

    Apple suspends AI notification summaries for news after generating false alerts

    January 16, 2025
  • Apps

    Trump Mask feud was perfect for X and jumped on the app store chart

    June 6, 2025

    iOS 19: All the rumor changes that Apple could bring to the new operating system

    June 6, 2025

    WWDC 2025: What to expect from this year's meeting

    June 6, 2025

    The court denied requests to suspend awards regarding Apple's App Store payment fees

    June 6, 2025

    Perplexity received 780 million questions last month, the CEO says

    June 5, 2025
  • Crypto

    xNotify Polymarket as partner in the official forecast market

    June 6, 2025

    Circle IPOs are giving hope to more startups waiting to be published to more startups

    June 5, 2025

    GameStop bought $500 million in Bitcoin

    May 28, 2025

    Vote for the session you want to watch in 2025

    May 26, 2025

    Save $900 + 90% from 2 tickets to destroy 2025 in the last 24 hours

    May 25, 2025
  • Security

    Humanity appoints national security experts to governing trusts

    June 6, 2025

    Italian lawmakers say Italy used spyware to target immigrant activists' mobile phones, but not for journalists

    June 6, 2025

    Humanity unveils custom AI models for US national security customers

    June 5, 2025

    Unlock phone company Cellebrite to acquire mobile testing startup Corellium for $170 million

    June 5, 2025

    Ransomware Gangs claim responsibility for Kettering Health Hack

    June 4, 2025
  • Startups

    7 days left: Founders and VCs save over $300 on all stage passes

    March 24, 2025

    AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

    March 24, 2025

    20 Hottest Open Source Startups of 2024

    March 22, 2025

    Andrill may build a weapons factory in the UK

    March 21, 2025

    Startup Weekly: Wiz bets paid off at M&A Rich Week

    March 21, 2025
  • TechCrunch

    OpenSea takes a long-term view with a focus on UX despite NFT sales remaining low

    February 8, 2024

    AI will save software companies' growth dreams

    February 8, 2024

    B2B and B2C are not about who buys, but how you sell

    February 5, 2024

    It's time for venture capital to break away from fast fashion

    February 3, 2024

    a16z's Chris Dixon believes it's time to focus on blockchain use cases rather than speculation

    February 2, 2024
  • Venture

    Why investing in a growing AI startup is risky and more complicated

    June 6, 2025

    Startup Battlefield 200: Only 3 days left

    June 6, 2025

    Book all TC Stage Exhibitor Tables before ending today

    June 6, 2025

    Less than 48 hours left until display at TC at all stages

    June 5, 2025

    TC Session: AI will be on sale today at Berkeley

    June 5, 2025
TechBrunchTechBrunch

StarCoder 2 is a code generation AI that can run on most GPUs

TechBrunchBy TechBrunchFebruary 28, 20247 Mins Read
Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
Share
Facebook Twitter LinkedIn Pinterest Telegram Email


Developers are adopting AI-powered code generators (services like GitHub Copilot and Amazon CodeWhisperer, and open access models like Meta's CodeLlama) at an astonishing rate. However, the tool is far from ideal. Many are not free. Others are available, but only under licenses that prohibit their use in general commercial contexts.

Recognizing the demand for alternatives, several years ago AI startup Hugging Face partnered with workflow automation platform ServiceNow to create StarCoder, an open source code generator with a less restrictive license than others. Did. The original was released online early last year, and a sequel, StarCoder 2, has been in development ever since.

StarCoder 2 is not a single code generation model, but a family. There are three versions of this version released today, the first two of which can run on modern consumer GPUs.

  • 3 billion parameter (3B) model trained by ServiceNow
  • 7 billion parameter (7B) model trained by Hugging Face
  • A 15 billion parameter (15B) model trained by Nvidia, the newest supporter of the StarCoder project.

(Note that “parameters” are the parts of the model learned from the training data that essentially define the model's skill for the problem, and in this case, the code it generates.)

Like most other code generators, StarCoder 2 can suggest ways to complete unfinished lines of code or retrieve summarized snippets of code in response to questions in natural language. Trained on four times the data of the original StarCoder, StarCoder 2 offers what Hugging Face, ServiceNow, and Nvidia characterize as “significantly” improved performance at lower operating costs.

StarCoder 2 can use GPUs like the Nvidia A100 to fine-tune first-party or third-party data in “hours” to create apps like chatbots and personal coding assistants. StarCoder 2 was also trained on a larger and more diverse dataset than the original StarCoder (approximately 619 programming languages), so it can, at least hypothetically, make more accurate and context-aware predictions.

“StarCoder 2 was created specifically for developers who need to build applications quickly,” Harm de Vries, head of ServiceNow's StarCoder 2 development team, told TechCrunch in an interview. “StarCoder2 allows developers to use its features to code more efficiently without sacrificing speed or quality.”

Now, I would venture to say that not all developers agree with De Vries in terms of speed and quality. Code generators promise to streamline certain coding tasks, but they come at a cost.

A recent study from Stanford University found that engineers who use code generation systems are more likely to introduce security vulnerabilities into the apps they develop. Additionally, a poll by cybersecurity firm Sonatype found that a majority of developers felt that code generators lack insight into how code is generated, and that generators generate unmanageable amounts of code. It has been shown that they are concerned about “code sprawl.”

The StarCoder 2 license may also be an obstacle for some people.

StarCoder 2 is licensed under Hugging Face's RAIL-M, which is intended to encourage responsible use by imposing “light touch” restrictions on both model licensees and downstream users. Although RAIL-M is less restrictive than many other licenses, it is not truly “open” in the sense that it does not allow a developer to use his StarCoder 2 for . every Possible applications (for example, apps that provide medical advice are strictly off-limits). Some commentators have said that RAIL-M's requirements may be too vague to comply with in any case, and that RAIL-M may conflict with AI-related regulations such as EU AI law. I am.

Putting all this aside for a moment, is StarCoder 2 really better than other code generators, free or paid?

In some benchmarks, it appears to be more efficient than one of CodeLlama's versions, CodeLlama 33B. According to Hugging Face, StarCoder 2 15B matches CodeLlama 33B twice as fast in a subset of code completion tasks. It's not clear which task. Hug face is not specified.

As an open source collection of models, StarCoder 2 also has the advantage of being able to be deployed locally to “study” the developer's source code or codebase. It's an attractive prospect for developers and companies wary of exposing code to cloud-hosted AI. His 2023 study by Portal26 and Censuswide found that 85% of companies are choosing not to use code generators due to privacy and security risks, such as having employees share sensitive information or vendors training on proprietary data. responded that he is cautious about introducing GenAI.

Hugging Face, ServiceNow, and Nvidia also claim that StarCoder 2 is more ethical and has fewer legal issues than its competitors.

All GenAI models regurgitate. In other words, it spits out a mirror copy of the data used for training. It doesn't take an active imagination to understand why this puts developers in trouble. If you use a code generator that has been trained on copyrighted code, even with filters and additional safeguards, the generator will unknowingly recommend copyrighted code and add that code to the code. may not be able to be labeled as such.

Several vendors, including GitHub, Microsoft (GitHub's parent company), and Amazon, have committed to providing legal protection if their code generator customers are accused of copyright infringement. However, coverage varies by vendor and is typically limited to corporate customers.

In contrast to code generators trained using copyrighted code (particularly GitHub Copilot), StarCoder 2 is based on a license from Software Heritage, a nonprofit organization that provides code archiving services. Trained on data only. Prior to StarCoder 2 training, BigCode, the cross-organizational team behind much of the StarCoder 2 roadmap, gave code owners the opportunity to opt out of the training set if they wished.

Like the original StarCoder, StarCoder 2's training data is available for developers to fork, clone, and audit as needed.

Leandro von Werra, Hugging Face Machine Learning Engineer and Co-Leader of BigCode, says that while open code generators have proliferated recently, they are not accompanied by information about the data used to train them or how they are actually done. He pointed out that there are very few. they were trained.

“From a scientific point of view, the problem is that the training is not reproducible, but even as data creators (i.e. people who upload code to GitHub), you have no control over whether or not your data was used and how. We do not know whether it was used in interview. “StarCoder 2 addresses this issue by being completely transparent throughout the training pipeline, from scraping pre-training data to training itself.”

However, StarCoder 2 isn't perfect. Like any code generator, it is susceptible to bias. De Vries points out that codes can be generated that include elements that reflect gender and racial stereotypes. Additionally, StarCoder 2 is trained primarily on English comments, Python, and Java code, so it performs poorly in non-English languages ​​and “low resource” code such as Fortran and Haksell.

Still, von Werra insists this is a step in the right direction.

“We strongly believe that building trust and accountability in AI models requires transparency and auditability of the complete model pipeline, including training data and training recipes,” he said. Told. “Star Coder 2” [showcases] How can a fully open model achieve competitive performance? ”

Like this author, you may be wondering what motivates Hugging Face, ServiceNow, and Nvidia to invest in a project like StarCoder 2. They are a business, after all, and training models don't come cheap.

As far as I know, this is a proven strategy. That means promoting goodwill and building paid services on top of open source releases.

ServiceNow already uses StarCoder to create Now LLM. It's a code generation product that's fine-tuned to ServiceNow's workflow patterns, use cases, and processes. Hugging Face, which offers model implementation consulting plans, offers a hosted version of his StarCoder 2 model on its platform. Nvidia is doing the same, making StarCoder 2 available through an API and web front end.

Developers specifically interested in a free offline experience can download StarCoder 2 (models, source code, etc.) from the project's GitHub page.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

OpenAI seeks to extend human lifespans with the help of longevity startups

January 17, 2025

Farewell to the $200 million woolly mammoth and TikTok

January 17, 2025

Nord Security founder launches Nexos.ai to help enterprises move AI projects from pilot to production

January 17, 2025

Data proves it remains difficult for startups to raise capital, even though VCs invested $75 billion in the fourth quarter

January 16, 2025

Apple suspends AI notification summaries for news after generating false alerts

January 16, 2025

Nvidia releases more tools and guardrails to help enterprises adopt AI agents

January 16, 2025

Leave A Reply Cancel Reply

Top Reviews
Editors Picks

7 days left: Founders and VCs save over $300 on all stage passes

March 24, 2025

AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

March 24, 2025

20 Hottest Open Source Startups of 2024

March 22, 2025

Andrill may build a weapons factory in the UK

March 21, 2025
About Us
About Us

Welcome to Tech Brunch, your go-to destination for cutting-edge insights, news, and analysis in the fields of Artificial Intelligence (AI), Cryptocurrency, Technology, and Startups. At Tech Brunch, we are passionate about exploring the latest trends, innovations, and developments shaping the future of these dynamic industries.

Our Picks

Why investing in a growing AI startup is risky and more complicated

June 6, 2025

Humanity appoints national security experts to governing trusts

June 6, 2025

Italian lawmakers say Italy used spyware to target immigrant activists' mobile phones, but not for journalists

June 6, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

© 2025 TechBrunch. Designed by TechBrunch.
  • Home
  • About Tech Brunch
  • Advertise with Tech Brunch
  • Contact us
  • DMCA Notice
  • Privacy Policy
  • Terms of Use

Type above and press Enter to search. Press Esc to cancel.