How to predict the success or failure of a new retail business

  • October 9, 2018
How to predict the success or failure of a new retail business

Krittika D'Silva leads study which uses social media and transport data to predict a new retail business' likelihood of success.

One of the most important questions for any new business is the amount of demand it will receive.

Krittika D'Silva

Researchers led by Gates Cambridge Scholar Krittika D'Silva have used a combination of social media and transport data to predict the likelihood that a given retail business will succeed or fail. 

Using information from 10 different cities around the world, the researchers, led by the University of Cambridge, have developed a model that can predict with 80% accuracy whether a new business will fail within six months. The results will be presented at the ACM Conference on Pervasive and Ubiquitous Computing (Ubicomp), taking place this week in Singapore.

While the retail sector has always been risky, the past several years have seen a transformation of high streets as more and more retailers fail. The model built by the researchers could be useful for both entrepreneurs and urban planners when determining where to locate their business or which areas to invest in.

“One of the most important questions for any new business is the amount of demand it will receive. This directly relates to how likely that business is to succeed,” said lead author Krittika [2016], a PhD student at Cambridge's Department of Computer Science and Technology. “What sort of metrics can we use to make those predictions?”

D’Silva and her colleagues used more than 74 million check-ins from the location-based social network Foursquare from Chicago, Helsinki, Jakarta, London, Los Angeles, New York, Paris, San Francisco, Singapore and Tokyo; and data from 181 million taxi trips from New York and Singapore.

Using this data, the researchers classified venues according to the properties of the neighbourhoods in which they were located, the visit patterns at different times of day, and whether a neighbourhood attracted visitors from other neighbourhoods.

“We wanted to better understand the predictive power that metrics about a place at a certain point in time have,” said Krittika.

Whether a business succeeds or fails is normally based on a number of controllable and uncontrollable factors. Controllable factors might include the quality or price of the store’s product, its opening hours and its customer satisfaction. Uncontrollable factors might include unemployment rates of a city, overall economic conditions and urban policies.

“We found that even without information about any of these uncontrollable factors, we could still use venue-specific, location-related and mobility-based features in predicting the likely demise of a business,” said D’Silva.

The data showed that across all 10 cities, venues that are popular around the clock, rather than just at certain points of day, are more likely to succeed. Additionally, venues that are in demand outside of the typical popular hours of other venues in the neighbourhood tend to survive longer. The data also suggested that venues in diverse neighbourhoods, with multiple types of businesses, tend to survive longer.

While the ten cities had certain similarities, the researchers also had to account for their differences.

“The metrics that were useful predictors vary from city to city, which suggests that factors affect cities in different ways,” said Krittika. “As one example, that the speed of travel to a venue is a significant metric only in New York and Tokyo. This could relate to the speed of transit in those cities or perhaps to the rates of traffic.”

To test the predictive power of their model, the researchers first had to determine whether a particular venue had closed within the time window of their data set. They then ‘trained’ the model on a subset of venues, telling the model what the features of those venues were in the first time window and whether the venue was open or closed in a second time window. They then tested the trained model on another subset of the data to see how accurate it was.

According to the researchers, their model shows that when deciding when and where to open a business, it is important to look beyond the static features of a given neighbourhood and to consider the ways that people move to and through that neighbourhood at different times of day. They now want to consider how these features vary across different neighbourhoods in order to improve the accuracy of their model.

*Photo of Regent Street. Credit: toastbrot81

Krittika D'Silva

Krittika D'Silva

  • Alumni
  • Canada
  • 2016 PhD Computer Science
  • Jesus College

As an undergraduate at the University of Washington, I majored in Bioengineering and Computer Engineering. I worked in three research labs building technology for individuals with lower limb amputations, mobile software for low resource settings, and DNA molecules for long-term data storage. I believe phones can be a valuable tool for change and I look forward to continuing research in mobile systems at Cambridge.

Previous Education

University of Washington

Latest News

Olympic opening ceremony harks back to tradition of ‘liquid streets’

The opening ceremony of the 2024 Olympic Games today will see athletes from around the world cross the centre of Paris on boats, navigating the waters of the river Seine, using it and its banks as life-size stages. Although the ceremony is being billed as innovative, it is in fact part of a centuries-old tradition […]

Why AI needs to be inclusive

When Hannah Claus [2024] studied computer science at school she soon realised that she was in a room full of white boys, looking at posters of white men. “I could not see myself in that,” she says. “I realised there were no role models to follow and that I had to become that myself. There […]

New book deal for Gates Cambridge Scholar

A Gates Cambridge Scholar has signed a deal to write a book on Indigenous climate justice. The Longest Night will be published by Atria Books, part of Simon & Schuster, and was selected as the deal of the day by Publishers Marketplace earlier this week. Described as “a stunning exploration of the High North and […]

Why understanding risk for different populations can reduce cardiovascular deaths

The incidence of cardiovascular disease (CVD) – the number one cause of death globally – can be reduced significantly by understanding the risk faced by different populations better, according to a new study. Identifying individuals at high risk and intervening to reduce risk before an event occurs underpins the majority of national and international primary […]