Statistical inference from power law distributed web-based social interactions

Daphne R. Raban, Eyal Rabin

Research output: Contribution to journalArticlepeer-review

Abstract

Purpose The purpose of this paper is to propose a method for statistical inference on data from power law distributions in order to explain behavior and social phenomena associated with web-based social spaces such as discussion forums, question-and-answer sites, web 2.0 applications and the like. Design/methodology/approach The paper starts by highlighting the importance of explaining behavior in social networks. Next, the power law nature of social interactions is described and a hypothetical example is used to explain why analyzing sub-sets of data might misrepresent the relationship between variables having power law distributions. Analysis requires the use of the complete distribution. The paper proposes logarithmic transformation prior to correlation and regression analysis and shows why it works using the hypothetical example and field data retrieved from Microsoft's Netscan project. Findings The hypothetical example emphasizes the importance of analyzing complete datasets harvested from social spaces. The Netscan example shows the importance of the logarithmic transformation for enabling the development of a predictive regression model based on the power law distributed data. Specifically, it shows that the number of new and returning participants are the main predictors of discussion forum activity. Originality/value This paper offers a useful analysis tool for anyone interested in social aspects of the Internet as well as corporate intra-net systems, knowledge management systems or other systems that support social interaction such as cellular phones and mobile devices. It also explains how to avoid errors by paying attention to assumptions and range restriction issues.

Original languageEnglish
Pages (from-to)266-278
Number of pages13
JournalInternet Research
Volume19
Issue number3
DOIs
StatePublished - 5 Jun 2009

Keywords

  • Internet
  • Knowledge sharing
  • Polynomials
  • Social interaction
  • Statistical analysis

ASJC Scopus subject areas

  • Communication
  • Sociology and Political Science
  • Economics and Econometrics

Fingerprint

Dive into the research topics of 'Statistical inference from power law distributed web-based social interactions'. Together they form a unique fingerprint.

Cite this