Repository logo
  • English
  • Deutsch
  • Español
  • Français
  • Log In
    New user? Click here to register.Have you forgotten your password?
Repository logo
  • Communities & Collections
  • Research Outputs
  • Fundings & Projects
  • People
  • Statistics
  • English
  • Deutsch
  • Español
  • Français
  • Log In
    New user? Click here to register.Have you forgotten your password?
  1. Home
  2. CRIS
  3. Publication
  4. Beyond Quality: Predicting Citation Impact in Business Research Using Data Science
 
  • Details
Options

Beyond Quality: Predicting Citation Impact in Business Research Using Data Science

Journal
Publications
ISSN
2304-6775
Date Issued
2025-09-05
Author(s)
PEREZ CAMPDESUÑER, REYNER FRANCISCO  
Facultad de Derecho, Ciencias Administrativas y Sociales  
SANCHEZ RODRIGUEZ, ALEXANDER  
Facultad de Ciencias de la Ingeniería e Industrias  
MARTÍNEZ VIVAR, RODOBALDO  
Facultad de Derecho, Ciencias Administrativas y Sociales  
Margarita De Miguel-Guzmán
GARCIA VIDAL, GELMAR  
Facultad de Derecho, Ciencias Administrativas y Sociales  
DOI
10.3390/publications13030042
Abstract
The volume of scientific publications has increased exponentially over the past decades across virtually all academic disciplines. In this landscape of information overload, objective criteria are needed to identify high-impact research.

Citation counts have traditionally served as a primary indicator of scientific relevance; however, questions remain as to whether they truly reflect the intrinsic quality of a publication. This study investigates the relationship between citation frequency and a wide range of editorial, authorship, and contextual variables. A dataset of 339,609 articles indexed in Scopus was analyzed, retrieved using the search query TITLE-ABS-KEY (management) AND LIMIT-TO (subarea, “Busi”).

The research employed a descriptive analysis followed by two predictive modeling approaches: a Random Forest algorithm to assess variable importance, and a binary logistic regression to estimate the probability of a paper being cited. Results indicate that factors such as journal quartile, country of affiliation, number of authors, open access availability, and keyword usage significantly influence citation outcomes.

The Random Forest model explained 94.9% of the variance, while the logistic model achieved an AUC of 0.669, allowing the formulation of a predictive citation equation.

Findings suggest that multiple determinants beyond content quality drive citation behavior, and that citation probability can be predicted with reasonable accuracy, though inherent model limitations must be acknowledged.
Subjects

bibliometric studies

business and manageme...

citations

random forest models

  • Cookie settings
  • Privacy policy
  • End User Agreement
  • Send Feedback

Hosting & Support by

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science