If An Artificial Intelligence Read 2.5 Million News Articles, What Would It Learn?

Artificial Intelligence (AI) algorithms recently parsed 2.5 million articles from 498 different English-language (online) news outlets over a period of ten months and created data about what was contained.  Could AI qualitatively give people more interesting news? The results showed what you likely knew - online tabloid newspapers are more readable than broadsheets and use more sentimental language. Among 15 U.S. and U.K. newspapers, The Sun was the 'easiest' to read - comparable to the BBC's children's news program, Newsround - while they found the The Guardian was the most difficult to read.

Artificial Intelligence (AI) algorithms recently parsed 2.5 million articles from 498 different English-language (online) news outlets over a period of ten months and created data about what was contained.  Could AI qualitatively give people more interesting news?

The results showed what you likely knew - online tabloid newspapers are more readable than broadsheets and use more sentimental language. Among 15 U.S. and U.K. newspapers, The Sun was the 'easiest' to read - comparable to the BBC's children's news program, Newsround - while they found the The Guardian was the most difficult to read.

'Sport' and 'Arts' topics were the most readable topics while 'Politics' and 'Environment' were the least and sports showed a predictable male bias.  Sports news mentioned men eight times more often than women while 'Fashion' and 'Arts' were the least biased - 'Fashion' articles mentioned equal proportions of men and women. 

The Sun was also the most likely to use sentimental adjectives while the Wall Street Journal used the fewest emotional adjectives. 

The most appealing topics to online readers were 'Disasters', 'Crime', and the 'Environment' while the least appealing topics were 'Fashion', 'Markets' and 'Prices'. The researchers also found that the popular articles tend to be more readable and more linguistically subjective.

Obviously not all content is put online so that skewed the data set.  Online papers put the most popular stuff on the WWW.

Nello Cristianini, Professor of Artificial Intelligence at the University of Bristol, said about the research, "The automation of many tasks in news content analysis will not replace the human judgement needed for fine-grained, qualitative forms of analysis, but it allows researchers to focus their attention on a scale far beyond the sample sizes of traditional forms of content analysis."

Professor Justin Lewis, Head of the School of Journalism, Media and Cultural Studies at Cardiff, found the results a way to invoke gender bias - "Even some of the more predictable findings give us pause for thought. The extent to which news is male dominated shows how far we are from gender equity across most areas of public life. The fact that articles about politics are the least readable might also explain widespread public disengagement."

The results were published online in Digital Journalism.

Old NID
97094
Categories

Latest reads

Article teaser image
Donald Trump does not have the power to rescind either constitutional amendments or federal laws by mere executive order, no matter how strongly he might wish otherwise. No president of the United…
Article teaser image
The Biden administration recently issued a new report showing causal links between alcohol and cancer, and it's about time. The link has been long-known, but alcohol carcinogenic properties have been…
Article teaser image
In British Iron Age society, land was inherited through the female line and husbands moved to live with the wife’s community. Strong women like Margaret Thatcher resulted.That was inferred due to DNA…