Close Menu
TechBrunchTechBrunch
  • Home
  • AI
  • Apps
  • Crypto
  • Security
  • Startups
  • TechCrunch
  • Venture

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

Coinbase CEO explains why he fired an engineer who didn't try AI right away

August 22, 2025

BlueSky blocks Mississippi services across age guarantee laws

August 22, 2025

Tiktok denies India's comeback after reporting that the website has been published

August 22, 2025
Facebook X (Twitter) Instagram
TechBrunchTechBrunch
  • Home
  • AI

    OpenAI seeks to extend human lifespans with the help of longevity startups

    January 17, 2025

    Farewell to the $200 million woolly mammoth and TikTok

    January 17, 2025

    Nord Security founder launches Nexos.ai to help enterprises move AI projects from pilot to production

    January 17, 2025

    Data proves it remains difficult for startups to raise capital, even though VCs invested $75 billion in the fourth quarter

    January 16, 2025

    Apple suspends AI notification summaries for news after generating false alerts

    January 16, 2025
  • Apps

    BlueSky blocks Mississippi services across age guarantee laws

    August 22, 2025

    Tiktok denies India's comeback after reporting that the website has been published

    August 22, 2025

    Google makes it easier to edit drive videos with the new VIDS shortcut button

    August 22, 2025

    X brings out the ability to like and follow the free tier of developer APIs

    August 22, 2025

    As India bans real money games, dream sports, MPL starts pulling plugs

    August 21, 2025
  • Crypto

    Coinbase CEO explains why he fired an engineer who didn't try AI right away

    August 22, 2025

    Your next customer is destroying the 2025 Expo floor

    August 19, 2025

    Crypto Company Gemini File for Winklevoss Twins IPO

    August 16, 2025

    North Korean spies pretending to be remote workers have invaded hundreds of businesses, CloudStrike says

    August 4, 2025

    Telegram's Crypto Wallet will be released in the US

    July 22, 2025
  • Security

    Developers get prison time to disrupt the ex-employer's network with “kill switch”

    August 22, 2025

    Explain why hackers who exposed the North Korean government did that

    August 21, 2025

    Device searches at US borders hit record-breaking records, new data show

    August 20, 2025

    Listen and record all conversations “Always On” Harvard Dropout launches AI smart glasses

    August 20, 2025

    New Zero-Day startup offers $20 million for a tool that can hack your smartphone

    August 20, 2025
  • Startups

    7 days left: Founders and VCs save over $300 on all stage passes

    March 24, 2025

    AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

    March 24, 2025

    20 Hottest Open Source Startups of 2024

    March 22, 2025

    Andrill may build a weapons factory in the UK

    March 21, 2025

    Startup Weekly: Wiz bets paid off at M&A Rich Week

    March 21, 2025
  • TechCrunch

    OpenSea takes a long-term view with a focus on UX despite NFT sales remaining low

    February 8, 2024

    AI will save software companies' growth dreams

    February 8, 2024

    B2B and B2C are not about who buys, but how you sell

    February 5, 2024

    It's time for venture capital to break away from fast fashion

    February 3, 2024

    a16z's Chris Dixon believes it's time to focus on blockchain use cases rather than speculation

    February 2, 2024
  • Venture

    Y Combinator says Apple's App Store is hampering startup growth

    August 22, 2025

    Beanie baby in the brain rot era

    August 22, 2025

    Procuring multiple rounds of venture capital could be wrong for your startup

    August 21, 2025

    Strictlyvc at atrupt 2025: Inside the LP track

    August 21, 2025

    Even Rogers and Max Haot will take part in the Space Stage in 2025

    August 20, 2025
TechBrunchTechBrunch

StarCoder 2 is a code generation AI that can run on most GPUs

TechBrunchBy TechBrunchFebruary 28, 20247 Mins Read
Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
Share
Facebook Twitter LinkedIn Pinterest Telegram Email


Developers are adopting AI-powered code generators (services like GitHub Copilot and Amazon CodeWhisperer, and open access models like Meta's CodeLlama) at an astonishing rate. However, the tool is far from ideal. Many are not free. Others are available, but only under licenses that prohibit their use in general commercial contexts.

Recognizing the demand for alternatives, several years ago AI startup Hugging Face partnered with workflow automation platform ServiceNow to create StarCoder, an open source code generator with a less restrictive license than others. Did. The original was released online early last year, and a sequel, StarCoder 2, has been in development ever since.

StarCoder 2 is not a single code generation model, but a family. There are three versions of this version released today, the first two of which can run on modern consumer GPUs.

  • 3 billion parameter (3B) model trained by ServiceNow
  • 7 billion parameter (7B) model trained by Hugging Face
  • A 15 billion parameter (15B) model trained by Nvidia, the newest supporter of the StarCoder project.

(Note that “parameters” are the parts of the model learned from the training data that essentially define the model's skill for the problem, and in this case, the code it generates.)

Like most other code generators, StarCoder 2 can suggest ways to complete unfinished lines of code or retrieve summarized snippets of code in response to questions in natural language. Trained on four times the data of the original StarCoder, StarCoder 2 offers what Hugging Face, ServiceNow, and Nvidia characterize as “significantly” improved performance at lower operating costs.

StarCoder 2 can use GPUs like the Nvidia A100 to fine-tune first-party or third-party data in “hours” to create apps like chatbots and personal coding assistants. StarCoder 2 was also trained on a larger and more diverse dataset than the original StarCoder (approximately 619 programming languages), so it can, at least hypothetically, make more accurate and context-aware predictions.

“StarCoder 2 was created specifically for developers who need to build applications quickly,” Harm de Vries, head of ServiceNow's StarCoder 2 development team, told TechCrunch in an interview. “StarCoder2 allows developers to use its features to code more efficiently without sacrificing speed or quality.”

Now, I would venture to say that not all developers agree with De Vries in terms of speed and quality. Code generators promise to streamline certain coding tasks, but they come at a cost.

A recent study from Stanford University found that engineers who use code generation systems are more likely to introduce security vulnerabilities into the apps they develop. Additionally, a poll by cybersecurity firm Sonatype found that a majority of developers felt that code generators lack insight into how code is generated, and that generators generate unmanageable amounts of code. It has been shown that they are concerned about “code sprawl.”

The StarCoder 2 license may also be an obstacle for some people.

StarCoder 2 is licensed under Hugging Face's RAIL-M, which is intended to encourage responsible use by imposing “light touch” restrictions on both model licensees and downstream users. Although RAIL-M is less restrictive than many other licenses, it is not truly “open” in the sense that it does not allow a developer to use his StarCoder 2 for . every Possible applications (for example, apps that provide medical advice are strictly off-limits). Some commentators have said that RAIL-M's requirements may be too vague to comply with in any case, and that RAIL-M may conflict with AI-related regulations such as EU AI law. I am.

Putting all this aside for a moment, is StarCoder 2 really better than other code generators, free or paid?

In some benchmarks, it appears to be more efficient than one of CodeLlama's versions, CodeLlama 33B. According to Hugging Face, StarCoder 2 15B matches CodeLlama 33B twice as fast in a subset of code completion tasks. It's not clear which task. Hug face is not specified.

As an open source collection of models, StarCoder 2 also has the advantage of being able to be deployed locally to “study” the developer's source code or codebase. It's an attractive prospect for developers and companies wary of exposing code to cloud-hosted AI. His 2023 study by Portal26 and Censuswide found that 85% of companies are choosing not to use code generators due to privacy and security risks, such as having employees share sensitive information or vendors training on proprietary data. responded that he is cautious about introducing GenAI.

Hugging Face, ServiceNow, and Nvidia also claim that StarCoder 2 is more ethical and has fewer legal issues than its competitors.

All GenAI models regurgitate. In other words, it spits out a mirror copy of the data used for training. It doesn't take an active imagination to understand why this puts developers in trouble. If you use a code generator that has been trained on copyrighted code, even with filters and additional safeguards, the generator will unknowingly recommend copyrighted code and add that code to the code. may not be able to be labeled as such.

Several vendors, including GitHub, Microsoft (GitHub's parent company), and Amazon, have committed to providing legal protection if their code generator customers are accused of copyright infringement. However, coverage varies by vendor and is typically limited to corporate customers.

In contrast to code generators trained using copyrighted code (particularly GitHub Copilot), StarCoder 2 is based on a license from Software Heritage, a nonprofit organization that provides code archiving services. Trained on data only. Prior to StarCoder 2 training, BigCode, the cross-organizational team behind much of the StarCoder 2 roadmap, gave code owners the opportunity to opt out of the training set if they wished.

Like the original StarCoder, StarCoder 2's training data is available for developers to fork, clone, and audit as needed.

Leandro von Werra, Hugging Face Machine Learning Engineer and Co-Leader of BigCode, says that while open code generators have proliferated recently, they are not accompanied by information about the data used to train them or how they are actually done. He pointed out that there are very few. they were trained.

“From a scientific point of view, the problem is that the training is not reproducible, but even as data creators (i.e. people who upload code to GitHub), you have no control over whether or not your data was used and how. We do not know whether it was used in interview. “StarCoder 2 addresses this issue by being completely transparent throughout the training pipeline, from scraping pre-training data to training itself.”

However, StarCoder 2 isn't perfect. Like any code generator, it is susceptible to bias. De Vries points out that codes can be generated that include elements that reflect gender and racial stereotypes. Additionally, StarCoder 2 is trained primarily on English comments, Python, and Java code, so it performs poorly in non-English languages ​​and “low resource” code such as Fortran and Haksell.

Still, von Werra insists this is a step in the right direction.

“We strongly believe that building trust and accountability in AI models requires transparency and auditability of the complete model pipeline, including training data and training recipes,” he said. Told. “Star Coder 2” [showcases] How can a fully open model achieve competitive performance? ”

Like this author, you may be wondering what motivates Hugging Face, ServiceNow, and Nvidia to invest in a project like StarCoder 2. They are a business, after all, and training models don't come cheap.

As far as I know, this is a proven strategy. That means promoting goodwill and building paid services on top of open source releases.

ServiceNow already uses StarCoder to create Now LLM. It's a code generation product that's fine-tuned to ServiceNow's workflow patterns, use cases, and processes. Hugging Face, which offers model implementation consulting plans, offers a hosted version of his StarCoder 2 model on its platform. Nvidia is doing the same, making StarCoder 2 available through an API and web front end.

Developers specifically interested in a free offline experience can download StarCoder 2 (models, source code, etc.) from the project's GitHub page.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

OpenAI seeks to extend human lifespans with the help of longevity startups

January 17, 2025

Farewell to the $200 million woolly mammoth and TikTok

January 17, 2025

Nord Security founder launches Nexos.ai to help enterprises move AI projects from pilot to production

January 17, 2025

Data proves it remains difficult for startups to raise capital, even though VCs invested $75 billion in the fourth quarter

January 16, 2025

Apple suspends AI notification summaries for news after generating false alerts

January 16, 2025

Nvidia releases more tools and guardrails to help enterprises adopt AI agents

January 16, 2025

Leave A Reply Cancel Reply

Top Reviews
Editors Picks

7 days left: Founders and VCs save over $300 on all stage passes

March 24, 2025

AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

March 24, 2025

20 Hottest Open Source Startups of 2024

March 22, 2025

Andrill may build a weapons factory in the UK

March 21, 2025
About Us
About Us

Welcome to Tech Brunch, your go-to destination for cutting-edge insights, news, and analysis in the fields of Artificial Intelligence (AI), Cryptocurrency, Technology, and Startups. At Tech Brunch, we are passionate about exploring the latest trends, innovations, and developments shaping the future of these dynamic industries.

Our Picks

Coinbase CEO explains why he fired an engineer who didn't try AI right away

August 22, 2025

BlueSky blocks Mississippi services across age guarantee laws

August 22, 2025

Tiktok denies India's comeback after reporting that the website has been published

August 22, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

© 2025 TechBrunch. Designed by TechBrunch.
  • Home
  • About Tech Brunch
  • Advertise with Tech Brunch
  • Contact us
  • DMCA Notice
  • Privacy Policy
  • Terms of Use

Type above and press Enter to search. Press Esc to cancel.