Research based on social media data can contain hidden biases that ‘misrepresent real world,’ critics say
2014/12/05 Leave a comment
Good article on some of the limits in using social media for research, as compared to IRL (In Real Life):
One is ensuring a representative sample, a problem that is sometimes, but not always, solved by ever greater numbers. Another is that few studies try to “disentangle the human from the platform,” to distinguish the user’s motives from what the media are enabling and encouraging him to do.
Another is that data can be distorted by processes not designed primarily for research. Google, for example, stores only the search terms used after auto-completion, not the text the user actually typed. Another is simply that many social media are largely populated by non-human robots, which mimic the behaviour of real people.
Even the cultural preference in academia for “positive results” can conceal the prevalence of null findings, the authors write.
“The biases and issues highlighted above will not affect all research in the same way,” the authors write. “[But] they share in common the need for increased awareness of what is actually being analyzed when working with social media data.”