Close Menu
TechBrunchTechBrunch
  • Home
  • AI
  • Apps
  • Crypto
  • Security
  • Startups
  • TechCrunch
  • Venture

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

Alexa Von Tobel has high expectations for “Fintech 3.0”

June 15, 2025

How to delete 23andMe data

June 14, 2025

New AI-generated tags in the App Store are in beta

June 14, 2025
Facebook X (Twitter) Instagram
TechBrunchTechBrunch
  • Home
  • AI

    OpenAI seeks to extend human lifespans with the help of longevity startups

    January 17, 2025

    Farewell to the $200 million woolly mammoth and TikTok

    January 17, 2025

    Nord Security founder launches Nexos.ai to help enterprises move AI projects from pilot to production

    January 17, 2025

    Data proves it remains difficult for startups to raise capital, even though VCs invested $75 billion in the fourth quarter

    January 16, 2025

    Apple suspends AI notification summaries for news after generating false alerts

    January 16, 2025
  • Apps

    New AI-generated tags in the App Store are in beta

    June 14, 2025

    Google Tests the Audio Summary for Search Queries

    June 13, 2025

    Beyond Bluesky: These are the apps building social experiences on the AT Protocol

    June 13, 2025

    Bluesky Backlash misses points

    June 12, 2025

    Google Cloud Outages bring a lot of internet

    June 12, 2025
  • Crypto

    xNotify Polymarket as partner in the official forecast market

    June 6, 2025

    Circle IPOs are giving hope to more startups waiting to be published to more startups

    June 5, 2025

    GameStop bought $500 million in Bitcoin

    May 28, 2025

    Vote for the session you want to watch in 2025

    May 26, 2025

    Save $900 + 90% from 2 tickets to destroy 2025 in the last 24 hours

    May 25, 2025
  • Security

    How to delete 23andMe data

    June 14, 2025

    Anne Wojcicki's nonprofit reaches a deal to win 23andMe

    June 14, 2025

    Apple fixes new iPhone Zero Day bugs used in Paragon Spyware Hacks

    June 12, 2025

    Researchers confirm that two journalists have been hacked with Paragon Spyware

    June 12, 2025

    US government vaccine websites have been tainted with content generated by AI

    June 11, 2025
  • Startups

    7 days left: Founders and VCs save over $300 on all stage passes

    March 24, 2025

    AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

    March 24, 2025

    20 Hottest Open Source Startups of 2024

    March 22, 2025

    Andrill may build a weapons factory in the UK

    March 21, 2025

    Startup Weekly: Wiz bets paid off at M&A Rich Week

    March 21, 2025
  • TechCrunch

    OpenSea takes a long-term view with a focus on UX despite NFT sales remaining low

    February 8, 2024

    AI will save software companies' growth dreams

    February 8, 2024

    B2B and B2C are not about who buys, but how you sell

    February 5, 2024

    It's time for venture capital to break away from fast fashion

    February 3, 2024

    a16z's Chris Dixon believes it's time to focus on blockchain use cases rather than speculation

    February 2, 2024
  • Venture

    Alexa Von Tobel has high expectations for “Fintech 3.0”

    June 15, 2025

    Investor Experience with TechCrunch All Stages: 1 Floor, Endless Trading Flow

    June 14, 2025

    New details appear on the scale of Meta's $14.3 billion contract

    June 13, 2025

    Founder Experience at TechCrunch All Stage: Building for those who build the following

    June 13, 2025

    11 startups from YC demo day that investors talk about

    June 13, 2025
TechBrunchTechBrunch

Fairgen uses synthetic data and AI-generated answers to 'enhance' survey results

TechBrunchBy TechBrunchMay 9, 20248 Mins Read
Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
Share
Facebook Twitter LinkedIn Pinterest Telegram Email


Surveys have been used since time immemorial to gain insights about populations, products, and public opinion. And while methodologies may have changed over the millennia, one thing remains the same. That means we need people, lots of people.

But what if you can't find enough people to build a sample group large enough to produce meaningful results? Or, even though you might be able to find enough people? First, what if budget constraints limit the amount of talent you can source and interview?

This is where Fairgen wants to help. An Israeli startup today launched a platform that uses “statistical AI” to generate synthetic data that it says is just as good as the real thing. The company also announced $5.5 million in new funding from Maverick Ventures Israel, The Creator Fund, Tal Ventures, Ignia, and a handful of angel investors, bringing total cash raised since inception to $8 million. .

“Fake data”

Data may be the lifeblood of AI, but it will also forever be the cornerstone of market research. So when two worlds collide, as he does in Fairgen's world, the need for high-quality data becomes a little more pronounced.

Founded in Tel Aviv, Israel in 2021, Fairgen was previously focused on tackling bias in AI. However, in late 2022, the company pivoted to a new product, Fairboost, which is currently launching from beta.

Fairboost promises to “boost” small datasets up to 3x, allowing you to target niches that might otherwise be too difficult or too expensive to reach. Allows for more detailed insight into the field. It allows companies to use statistical AI learning patterns across different research segments to train deep machine learning models for each dataset they upload to the Fairgen platform.

The concept of “synthetic data” (data created artificially rather than from real-world events) is not new. Its roots go back to the early days of computing, where it was used to test software and algorithms and simulate processes. However, as we understand today, synthetic data has taken on a life of its own and is increasingly used to train models, especially with the advent of machine learning. Using artificially generated data that does not contain sensitive information can address both data scarcity issues and data privacy concerns.

Fairgen is the latest startup to test synthetic data and is primarily targeting market research. It's worth noting that Fairgen isn't generating data out of thin air or throwing millions of past studies into an AI-powered melting pot. Market researchers must conduct research on a small sample of the target market, and from there Fairgen establishes patterns and expands the sample. The company states that at least for the original sample he can guarantee a 2x boost, but on average he can achieve a 3x boost.

In this way, Fairgen could prove that people of a certain age and/or income level tend to answer questions in a certain way. Or, combine any number of data points and extrapolate from the original data set. It's essentially creating what Samuel Cohen, Fairgen's co-founder and CEO, calls “stronger, more robust data segments with less error.”

“The main realization was that people are becoming more and more diverse. Brands need to adapt to that and understand their customer segments,” Cohen explained to TechCrunch. “The segments are very different. Gen Z thinks differently than older adults. And it would take a lot of money and a lot of time and operational resources to be able to understand this market at a segment level. And we realized that was the problem, and that's where synthetic data played a role.”

The obvious criticism, which the company acknowledges and disputes, is that this all sounds like a huge shortcut to getting out there and interviewing real people and gathering real opinions.

Certainly, underrepresented groups should be concerned that their real voices are being replaced by, well, false voices.

“Every customer we talk to in the research space has a huge blind spot, an audience that is completely difficult to reach,” Fernando Zatz, head of growth at Fairgen, told TechCrunch. Ta. “The reason they're not actually selling projects is because there's a lack of talent, especially in a world where markets are fragmented and increasingly diverse. Sometimes they can't go to certain countries. Since you can't target a specific demographic, you'll actually end up losing money on your project by not meeting your quota. [of respondents]And if that number is not reached, the insights will not be sold. ”

Fairgen is not the only company applying generative AI to the market research space. Qualtrics announced last year that it would invest $500 million over four years to bring generative AI to its platform, essentially focusing on qualitative research. But this is further evidence that synthetic data exists and will continue to exist.

However, validating the results plays an important role in convincing people that this is genuine and not a cost-cutting measure that will produce optimal results. Fairgen does this by comparing “real” sample boosts to “synthetic” sample boosts. Take a small sample of the dataset, extrapolate it, and line it up with the real thing.

“We do these exact same types of tests on every customer we sign up with,” Cohen said.

statistically speaking

Cohen holds a Master's degree in Statistical Science from the University of Oxford and a PhD in Machine Learning from UCL, London. As part of this, he spent nine months working as a research scientist at Meta.

One of the company's co-founders is chairman Benny Schneider, who previously worked in enterprise software and whose name has been withdrawn four times. In 2008 he exited Qumranet and sold it to Red Hat for $107 million. In 2004 he sold P-Cube to Cisco for $200 million. Then, in 2000, Pentacom was acquired by Cisco for $118.

and Emmanuel Candes, a professor of statistics and electrical engineering at Stanford University, is Fairgen's chief scientific advisor.

This business and mathematical backbone is a big selling point for companies trying to convince the world that fake data, if applied correctly, is every bit as good as real data. This is also a way to clearly articulate the thresholds and limits of the technology, i.e. how large a sample is required to achieve optimal boost.

Ideally, a survey should have at least 300 actual respondents, Cohen said, and from there Fairboost can expand the segment size to no more than 15% of the broader survey.

“If it's less than 15%, we can guarantee an average improvement of 3x based on hundreds of parallel tests,” Cohen said. “Statistically, above 15%, the increase is not very dramatic. The data already shows a good confidence level, and the synthetic respondents could potentially match them or have a slight increase. On the business side, there's nothing wrong with anything above 15%. Brands can already learn from these groups. They're just at a niche level.”

Factors not to use LLM

It's worth noting that Fairgen doesn't use large-scale language models (LLMs), and its platform doesn't produce “plain English” responses like ChatGPT. The reason is that LLM uses learning from countless other data sources besides the parameters of the study, increasing the likelihood of introducing biases that are incompatible with quantitative research.

Fairgen is all about statistical models and tabular data, and its training relies solely on the data contained within the uploaded dataset. This effectively allows market researchers to extrapolate from adjacent segments within a survey to generate new synthetic respondents.

“We don’t use LLMs for a very simple reason: If you pre-train with a large amount of LLMs, [other] If you do an investigation, it will only give you false information,'' Cohen said. “There may be cases where another investigation turns up something, but we don't want that. It's all about credibility.”

In terms of business model, Fairgen is sold as a SaaS, where businesses upload their surveys in a structured format (.CSV or .SAV) to Fairgen's cloud-based platform. Depending on the number of questions, it can take up to 20 minutes to train a model based on survey data given, Cohen said. Then, when the user selects a “segment” (a subset of respondents that shares certain characteristics) (for example, “Gen Z working in industry x”), Fairgen creates a segment with exactly the same structure as the original training file. Deliver new files. Question, just a new line.

Fairgen is used by BVA and French polling and market research firm IFOP, both of which have already integrated the startup's technology into their services. IFOP, which is a bit like America's Gallup, is using Fairgen for polling purposes in the European elections, but Cohen believes it could eventually be used in the US elections later this year. There is.

“IFOP is basically our stamp of approval because IFOP has been around for about 100 years,” Cohen said. “They validated the technology and were our original design partner. We are also testing or have already integrated with some of the largest market research companies in the world, but we have yet to talk about that.” I can not do it.”



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

Alexa Von Tobel has high expectations for “Fintech 3.0”

June 15, 2025

Investor Experience with TechCrunch All Stages: 1 Floor, Endless Trading Flow

June 14, 2025

New details appear on the scale of Meta's $14.3 billion contract

June 13, 2025

Founder Experience at TechCrunch All Stage: Building for those who build the following

June 13, 2025

11 startups from YC demo day that investors talk about

June 13, 2025

ICONIQ VCS courted the chime for two years and the company has not sold its shares

June 13, 2025

Leave A Reply Cancel Reply

Top Reviews
Editors Picks

7 days left: Founders and VCs save over $300 on all stage passes

March 24, 2025

AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

March 24, 2025

20 Hottest Open Source Startups of 2024

March 22, 2025

Andrill may build a weapons factory in the UK

March 21, 2025
About Us
About Us

Welcome to Tech Brunch, your go-to destination for cutting-edge insights, news, and analysis in the fields of Artificial Intelligence (AI), Cryptocurrency, Technology, and Startups. At Tech Brunch, we are passionate about exploring the latest trends, innovations, and developments shaping the future of these dynamic industries.

Our Picks

Alexa Von Tobel has high expectations for “Fintech 3.0”

June 15, 2025

How to delete 23andMe data

June 14, 2025

New AI-generated tags in the App Store are in beta

June 14, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

© 2025 TechBrunch. Designed by TechBrunch.
  • Home
  • About Tech Brunch
  • Advertise with Tech Brunch
  • Contact us
  • DMCA Notice
  • Privacy Policy
  • Terms of Use

Type above and press Enter to search. Press Esc to cancel.