Over a million developers have joined DZone.

Researchers Use Big Data to Gauge the Success of a Startup

DZone's Guide to

Researchers Use Big Data to Gauge the Success of a Startup

Test4startup uses an algorithm that was developed in conjunction with a university professor to supposedly test the prospects of a startup idea.

· Big Data Zone
Free Resource

Effortlessly power IoT, predictive analytics, and machine learning applications with an elastic, resilient data infrastructure. Learn how with Mesosphere DC/OS.


The past year has seen a number of projects emerge to try and help decipher whether a startup has what it takes to grow into something substantial.

I wrote about one such project earlier this year. The project, called Test4startup, uses an algorithm that was developed in conjunction with a university professor to supposedly test the prospects of a startup idea.

Or you have French company Early Metrics, who are like a ratings agency for startups.  They aim to examine a number of key factors before providing a single measure to highlight their potential.

Big Data and Text Mining

The latest effort comes from a researcher at the University of Texas at Arlington, who utilizes big data analytics to improve the market intelligence we have around startups in the tech world.

“Industry giants like Google, Microsoft and Yahoo are spending tens of billions of dollars a year on acquiring smaller firms for market entrance, strategic intellectual property and talented employees, but face a real challenge identifying companies with the right products or technology in the vast startup universe,”the researchers say.

“Our new approach uses big data analytics and a text-mining technique called topic modeling to identify potential matches,” they continue. “By analyzing unstructured, publicly available descriptions of any startups’ business, we can quantify any two firms’ business, geographic, investor and social proximity and from there identify potential targets for mergers and acquisitions.”

Topic Intelligence

The researchers have boiled their findings into a new company, called Topic Technologies, which uses their model to provide market intelligence to clients in the high-tech world.

The model was initially based upon publicly available information from Crunchbase, with data from some 24,000 privately held companies mined from the site.  The database contained the location of the headquarters, the industry sector, cofounders, key employees, board members, any investments they’d received and a description of each business.

The team used topic modeling to analyze the language used in these descriptions to determine the proximity of each startup to other companies.  This proximity rating, alongside factors such as location and social ties between key staff, was then used to gauge the possible success of any merger between two companies.

“This data-driven, analytics-based approach has proved effective in explaining mergers and acquisitions in the startup world and complements existing toolkits for measuring business proximity,” the researchers say. “Our system is particularly appropriate when the firms under study are small and privately held so industry classification in largely unavailable, which is the case for startups.”

Learn to design and build better data-rich applications with this free eBook from O’Reilly. Brought to you by Mesosphere DC/OS.

big data analytics ,topic modeling ,intelligence ,analytics ,metrics ,companies ,startup ,property ,information

Published at DZone with permission of Adi Gaskell, DZone MVB. See the original article here.

Opinions expressed by DZone contributors are their own.


Dev Resources & Solutions Straight to Your Inbox

Thanks for subscribing!

Awesome! Check your inbox to verify your email so you can start receiving the latest in tech news and resources.


{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}