Discovering Trending Topics from the Tweets By Odia News Media During Covid-19
Keywords:
Trend Analysis, Topic Modeling, Twitter, Covid-19Abstract
The onset of the Covid-19 pandemic and the lockdown imposed due to it has fueled the news consumption significantly. News portals including the ones in Odia language are actively feeding news related to Covid-19 to their consumers via their websites and Twitter handles. The news items didn't restrict to Covid-19 alone; they also touched a variety of domains of life like education, healthcare, administration, politics, movies, etc. Discovery of the news trends provides a bird’s eye view of the issues and topics that are popular in the online community. This could be of interest to advertisers, marketers, researchers, sociologists, and policymakers. This paper applies Topic Modeling to discover the trends from the tweets made by the Odia news media from 20th March 2020 to 31st August 2020, the period which saw the emergence of both lockdowns and unlocks in India. We found that during this period the Odia news media didn’t restrict themselves to report news surrounding Covid-19; rather they reported other happenings as well.
References
Kwak, H., Lee, C., Park, H., Moon, S.: What is Twitter, a social network or a news media? In: Proceedings of the 19th international conference on World wide web. pp. 591–600 (2010).
Farhi, P.: The Twitter explosion: Whether they are reporting about it, finding sources on it or urging viewers, listeners, and readers to follow them on it, journalists just can’t seem to get enough of the social networking service. Just how effective is it as a journalism tool? American journalism review. 31, 26–32 (2009).
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. Journal of machine Learning research. 3, 993–1022 (2003).
Blei, D.M., Lafferty, J.D.: Dynamic topic models. In: Proceedings of the 23rd international conference on Machine learning. pp. 113–120. ACM (2006).
Wang, C., Blei, D., Heckerman, D.: Continuous time dynamic topic models. arXiv preprint arXiv:1206.3298. (2012).
Kawamae, N.: Trend analysis model: trend consists of temporal words, topics, and timestamps. In: Proceedings of the fourth ACM international conference on Web search and data mining. pp. 317–326 (2011).
Wang, X., McCallum, A.: Topics over time: a non-Markov continuous-time model of topical trends. In: Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining. pp. 424–433 (2006).
Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic analysis. Journal of the American society for information science. 41, 391–407 (1990).
Papadimitriou, C.H., Raghavan, P., Tamaki, H., Vempala, S.: Latent semantic indexing: A probabilistic analysis. Journal of Computer and System Sciences. 61, 217–235 (2000).
Hofmann, T.: Probabilistic latent semantic indexing. In: ACM SIGIR Forum. pp. 211–218. ACM (2017).
Teh, Y.W., Jordan, M.I., Beal, M.J., Blei, D.M.: Sharing clusters among related groups: Hierarchical Dirichlet processes. In: Advances in neural information processing systems. pp. 1385–1392 (2005).
Surian, D., Nguyen, D.Q., Kennedy, G., Johnson, M., Coiera, E., Dunn, A.G.: Characterizing Twitter discussions about HPV vaccines using topic modeling and community detection. Journal of medical Internet research. 18, (2016).
Ghosh, D., Guha, R.: What are we ‘tweeting’about obesity? Mapping tweets with topic modeling and Geographic Information System. Cartography and geographic information science. 40, 90–102 (2013).
Alvarez-Melis, D., Saveski, M.: Topic modeling in twitter: Aggregating tweets by conversations. In: Tenth international AAAI conference on web and social media (2016).
Lu, R., Yang, Q.: Trend analysis of news topics on twitter. International Journal of Machine Learning and Computing. 2, 327 (2012).
Mathioudakis, M., Koudas, N.: Twittermonitor: trend detection over the twitter stream. In: Proceedings of the 2010 ACM SIGMOD International Conference on Management of data. pp. 1155–1158 (2010).
Lau, J.H., Collier, N., Baldwin, T.: On-line trend analysis with topic models:# twitter trends detection topic model online. In: Proceedings of COLING 2012. pp. 1519–1534 (2012).
Sha, H., Hasan, M.A., Mohler, G., Brantingham, P.J.: Dynamic topic modeling of the COVID-19 Twitter narrative among US governors and cabinet executives. arXiv preprint arXiv:2004.11692. (2020).
De Santis, E., Martino, A., Rizzi, A.: An Infoveillance System for Detecting and Tracking Relevant Topics From Italian Tweets During the COVID-19 Event. IEEE Access. 8, 132527–132538 (2020).
Singh, L., Bansal, S., Bode, L., Budak, C., Chi, G., Kawintiranon, K., Padden, C., Vanarsdall, R., Vraga, E., Wang, Y.: A first look at COVID-19 information and misinformation sharing on Twitter. arXiv preprint arXiv:2003.13907. (2020).
Lwin, M.O., Lu, J., Sheldenkar, A., Schulz, P.J., Shin, W., Gupta, R., Yang, Y.: Global sentiments surrounding the COVID-19 pandemic on Twitter: analysis of Twitter trends. JMIR public health and surveillance. 6, e19447 (2020).
Ordun, C., Purushotham, S., Raff, E.: Exploratory analysis of covid-19 tweets using topic modeling, umap, and digraphs. arXiv preprint arXiv:2005.03082. (2020).
Kabir, M., Madria, S., others: CoronaVis: A Real-time COVID-19 Tweets Analyzer. arXiv preprint arXiv:2004.13932. (2020).
Chaupattnaik, S., Nanda, S.S., Mohanty, S.: A suffix stripping algorithm for Odia stemmer. International Journal of Computational Linguistics and Natural Language Processing. 1, 1–5 (2012).
Sethi, D.P.: Design of lightweight stemmer for Odia derivational suffixes. Int. Journal of Advanced Research in Computer and Communication Engineering. 2, (2013).
Padhy, H., Mohanty, S.: Designing hybrid approach spell checker for Oriya. Int. J. Latest Trends Eng. Technol. 2, 156–160 (2013).
Jena, I., Chaudhury, S., Chaudhry, H., Sharma, D.M.: Developing Oriya morphological analyzer using Lt-toolbox. In: Information Systems for Indian Languages. pp. 124–129. Springer (2011).
Balabantaray, R., Jena, M., Mohanty, S.: Shallow morphology based complex predicates extraction in Oriya. International Journal of Computer Applications. 975, 8887 (2011).
Jena, I., Chaudhry, H., Sharma, D.M.: Oriya Morphological Analyzer Using Lttoolbox. In: International Symposium on Languages, Applications and Technologies. pp. 15–25. Springer (2015).
Mohapatra, R., Hembram, L.: Morph-Synthesizer for Oriya Language-A Computational Approach. Language In India. 10, (2010).
Balabantaray, R., Lenka, S., Sahoo, D.: Name Entity Recognizer for Odia using Conditional Random Fields. Indian Journal of Science and Technology. 6, 4290–4293 (2013).
Biswas, S., Mohanty, S., Mishra, S.: A hybrid Oriya named entity recognition system: integrating HMM with MaxEnt. In: 2009 Second International Conference on Emerging Trends in Engineering & Technology. pp. 639–643. IEEE (2009).
Jena, M.K., Mohanty, S.: Predicting Sensitivity of Local News Articles from Odia Dailies. In: International Conference on Biologically Inspired Techniques in Many-Criteria Decision Making. pp. 144–151. Springer (2019).
Jena, M.K., Mohanty, S.: Predicting Impact of Odia Newspaper Articles on Public Opinion. In: Progress in Computing, Analytics and Networking. pp. 265–272. Springer (2020).
Mohanty, G., Mishra, P., Mamidi, R.: Annotated Corpus for Sentiment Analysis in Odia Language. In: Proceedings of The 12th Language Resources and Evaluation Conference. pp. 2788–2795 (2020).
Chuang, J., Manning, C.D., Heer, J.: Termite: Visualization techniques for assessing textual topic models. In: Proceedings of the international working conference on advanced visual interfaces. pp. 74–77. ACM (2012).
Newman, D., Lau, J.H., Grieser, K., Baldwin, T.: Automatic evaluation of topic coherence. In: Human language technologies: The 2010 annual conference of the North American chapter of the association for computational linguistics. pp. 100–108 (2010).
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2020 International Journal of Machine Learning and Networked Collaborative Engineering
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
https://creativecommons.org/licenses/by/4.0/legalcode