Identifying Key Features in Yelp Data for Success in Different Types of Restaurants
DOI:
https://doi.org/10.29407/intensif.v9i1.23476Keywords:
Yelp Dataset Challenge, Linear Square Regression, Restaurant AttributesAbstract
Background: The purpose of this research is to measure of customer satisfaction for newly established independent restaurants and, consequently, good predictors of independent restaurant success. Urban communities face several challenges, including how to best use scarce resources like real estate and support small enterprises. Smart businesses are essential to the development of smart cities because they use data analytics to inform their strategic planning and design choices, and the target of this topic is restaurant. Objective: Restaurants control a sizable portion of the city market's small business sector. As part of the Yelp Data Challenge, Yelp just made available an open dataset that includes important details, ratings, and Yelp scores for every restaurant in different cities. Methods: Our methodology utilizes a vector of crucial factors to accurately forecast a business’s prospective success and exclusively evaluate eateries located inside the city limits of Las Vegas. The dependent variables will consist of the mean Yelp ratings for each restaurant and constructed our model by following the subsequent stages. Conclusion: The findings of this research is corroborated by the discovery that the statistically significant properties of restaurants, shown by a low p-value, varied across various restaurant categories, the unique modeling technique to forecast future restaurants' Yelp rankings based on their design choices. This will assist owners of restaurants in making better design choices, which will result in more prosperous small enterprises in urban settings.
Downloads
References
Yelp dataset challenge. Available: [Online]. Available: http://dx.doi.org/10.4225/13/511C71F86-12C3
S. National Restaurant Association et al., “Restaurant industry forecast,” 2016.
W. Farhan, “Predicting Yelp Restaurant Reviews,” UC San Diego, La Jolla, 2014.
S. W. I. Wahyudi, A. Affandi, and M. Hariadi, “Recommender engine using cosinesimilarity based on alternating least square-weight regularization,” in 2017 15th Quantum Information Research, 2017, pp. 1–6, doi: 10.1109/QIR.2017.8168492. DOI: https://doi.org/10.1109/QIR.2017.8168492
Y.-H. Hu, K. Chen, and P.-J. Lee, “The effect of user-controllable filters on the prediction of online hotel reviews,” Inf. Manag., vol. 54, no. 6, pp. 728–744, 2017, doi: 10.1016/j.im.2016.12.009. DOI: https://doi.org/10.1016/j.im.2016.12.009
A. Kong, V. Nguyen, and C. Xu, “Predicting International Restaurant Success with Yelp,” Stanford University, 2016.
M. Luca, “Reviews, reputation, and revenue: The case of yelp.com,” Harvard Business School NOM Unit Working Paper, no. 12-016, 2011. DOI: https://doi.org/10.2139/ssrn.1928601
L. Kwok and K. L. Xie, “Factors contributing to the helpfulness of online hotel reviews,” Int. J. Contemp. Hosp. Manag., vol. 28, no. 10, pp. 2156–2177, 2016, doi: 10.1108/IJCHM-03-2015-0107. DOI: https://doi.org/10.1108/IJCHM-03-2015-0107
E. S. Alamoudi and S. Al Azwari, “Exploratory Data Analysis and Data Mining on Yelp Restaurant Review,” in 2021 National Conference on Computing and Communication (NCCC), 2021, doi: 10.1109/NCCC49330.2021.9428850. DOI: https://doi.org/10.1109/NCCC49330.2021.9428850
H. Li, C. (R.) Wang, F. Meng, and Z. Zhang, “Making restaurant reviews useful and/or enjoyable? The impacts of temporal, explanatory, and sensory cues,” Int. J. Hosp. Manag., vol. 83, pp. 257–265, 2019, doi: 10.1016/j.ijhm.2018.11.002. DOI: https://doi.org/10.1016/j.ijhm.2018.11.002
Y. Chen and F. Xia, “Restaurants’ Rating Prediction Using Yelp Dataset,” in 2020 IEEE International Conference on Advances in Electrical Engineering and Computer Applications (AEECA), Dalian, China, 2020. DOI: https://doi.org/10.1109/AEECA49918.2020.9213704
M. R. D. Ching and R. de Dios Bulos, “Improving Restaurants’ Business Performance Using Yelp Data Sets through Sentiment Analysis,” in 2019 3rd International Conference on E-Commerce, E-Business and E-Government (ICEEG), 2019, pp. 62–67, doi: 10.1145/3340017.3340018. DOI: https://doi.org/10.1145/3340017.3340018
D. Keller and M. Kostromitina, “Characterizing non-chain restaurants’ Yelp star-ratings: Generalizable findings from a representative sample of Yelp reviews,” Int. J. Hosp. Manag., vol. 86, 2020, doi: 10.1016/j.ijhm.2019.102440. DOI: https://doi.org/10.1016/j.ijhm.2019.102440
J. Richards, S. Dabhi, F. Poursardar, and S. Jayarathna, “Poster: Leveraging Data Analysis and Machine Learning to Authenticate Yelp Reviews through User Metadata Patterns,” in 2023 ACM Conference on Computer and Communications Security (CCS), 2023, doi: 10.1145/3565287.3617983. DOI: https://doi.org/10.1145/3565287.3617983
H. Nakahara, “A naive approach to program extraction,” Publ. Res. Inst. Math. Sci., vol. 25, no. 3, 1990, doi: 10.2977/PRIMS/1195173352. DOI: https://doi.org/10.2977/prims/1195173352
F. Pérez-González and C. Troncoso, “Understanding Statistical Disclosure: A Least Squares approach,” in Lecture Notes in Computer Science, vol. 7384, 2012. DOI: https://doi.org/10.1007/978-3-642-31680-7_3
R. Giri, ., Rymmai, and J. S. Saleema, “Book Recommendation using Cosine Similarity,” Int. J. Adv. Res. Comput. Sci., vol. 8, no. 3, 2017, doi: 10.26483/IJARCS.V8I3.2995.
M. Sadikin and A. Fauzan, “Evaluation of Machine Learning Approach for Sentiment Analysis using Yelp Dataset,” Eur. J. Electr. Comput. Eng., vol. 7, no. 6, 2023, doi: 10.24018/ejece.2023.7.6.583. DOI: https://doi.org/10.24018/ejece.2023.7.6.583
Q. Xuan et al., “Modern Food Foraging Patterns: Geography and Cuisine Choices of Restaurant Patrons on Yelp,” IEEE Trans. Comput. Soc. Syst., vol. 5, no. 2, 2018, doi: 10.1109/TCSS.2018.2819659. DOI: https://doi.org/10.1109/TCSS.2018.2819659
M. Nakayama and Y. Wan, “The cultural impact on social commerce: A sentiment analysis on Yelp ethnic restaurant reviews,” Inf. Manag., 2019, doi: 10.1016/J.IM.2018.09.004. DOI: https://doi.org/10.1016/j.im.2018.09.004
C. Fu et al., “Link Weight Prediction Using Supervised Learning Methods and Its Application to Yelp Layered Network,” IEEE Trans. Knowl. Data Eng., vol. 30, no. 8, 2018, doi: 10.1109/TKDE.2018.2801854. DOI: https://doi.org/10.1109/TKDE.2018.2801854
S. B. Hegde, S. Satyappanavar, and S. Setty, “Restaurant setup business analysis using yelp dataset,” in 2017 IEEE International Conference on Advances in Computing, Communications and Informatics (ICACCI), 2017, doi: 10.1109/ICACCI.2017.8126196. DOI: https://doi.org/10.1109/ICACCI.2017.8126196
T. Doan and J. Kalita, “Sentiment Analysis of Restaurant Reviews on Yelp with Incremental Learning,” in 2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA), 2016, doi: 10.1109/ICMLA.2016.0123. DOI: https://doi.org/10.1109/ICMLA.2016.0123
R. Shen, J. Shen, Y. Li, and H. Wang, “Predicting usefulness of Yelp reviews with localized linear regression models,” in 2016 7th IEEE International Conference on Software Engineering and Service Science (ICSESS), Beijing, China, 2016, pp. 189–192, doi: 10.1109/ICSESS.2016.7883046. DOI: https://doi.org/10.1109/ICSESS.2016.7883046
S. S. Kumar et al., “Unveiling Patterns and Enhancing Recommendations: A Novel Regression Analysis Approach for Yelp Dataset,” in 2023 International Conference on Next Generation Electronics (NEleX), 2023, doi: 10.1109/NEleX59773.2023.10421042. DOI: https://doi.org/10.1109/NEleX59773.2023.10421042
N. Asghar, “Yelp dataset challenge: review rating prediction,” arXiv preprint, 2016, doi: 10.48550/arxiv.1605.05362.
E. Anenberg, C. Kuang, and E. Kung, “Social learning and local consumption amenities: evidence from yelp*,” J. Ind. Econ., vol. 70, no. 2, pp. 294–322, 2022, doi: 10.1111/joie.12291. DOI: https://doi.org/10.1111/joie.12291
M. Dolatabadi et al., “Cognitive sequential dependencies in the wild: sentiment analysis approach,” arXiv preprint, 2020, doi: 10.31234/osf.io/4mw8c. DOI: https://doi.org/10.31234/osf.io/4mw8c
Y. Shen et al., “Using social media to assess the consumer nutrition environment,” Public Health Nutr., vol. 22, no. 2, pp. 257–264, 2018, doi: 10.1017/s1368980018002872. DOI: https://doi.org/10.1017/S1368980018002872
M. Rahman, B. Carbunar, J. Ballesteros, and D. Chau, “To catch a fake: curbing deceptive yelp ratings and venues,” Stat. Anal. Data Min. ASA Data Sci. J., vol. 8, no. 3, pp. 147–161, 2015, doi: 10.1002/sam.11264. DOI: https://doi.org/10.1002/sam.11264
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Andrianshah Priyadi, Nelly Malik Lande, Anita Faradilla, Ma’arif Hasan, Evi Widianti

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with this journal agree to the following terms:
1. Copyright on any article is retained by the author(s).
2. The author grants the journal, right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work’s authorship and initial publication in this journal.
3. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal’s published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
4. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.
5. The article and any associated published material is distributed under the Creative Commons Attribution-ShareAlike 4.0 International License