I’ll be delving into text mining applications for my master thesis and I need data for it. Ideally, I would need a corpora of texts/news articles from some single (or multiple) credible and authoritative source covering financial markets/economy and the like, spanning a time period as long as possible.
Does anyone know of datasets/corporas that would suit my requirements?
Any other possibilities/ideas to construct one such dataset using open resources?
I know this question has been already asked but the answers that were posted are not longer valid.
Reuters Financial Dataset is a large collection of Financial News Article scraped from Reuters website. Originally used for the paper Using Structured Events to Predict Stock Price Movement:An Empirical Investigation - Ding et al.(2014) this set of unstructured data is a powerful warehouse of historic Financial Data. This script provides a way of arranging the huge corpus of information into a Pandas' efficient data structure DataFrame
Originally, this repository consisted of badly written Python script which was monolitic and cryptic. This refactor breaks the code down into smaller functions and comes equipped with a function to create the DataFrame.
Answered by Pluviophile on November 20, 2021
7 Asked on January 4, 2022
1 Asked on January 4, 2022 by k-ghazal
1 Asked on January 2, 2022 by maksim-grinman
0 Asked on December 17, 2021 by stan-rhodes
0 Asked on December 2, 2021 by shishir-kumar
1 Asked on November 22, 2021 by tav
1 Asked on November 22, 2021
0 Asked on November 22, 2021 by mikey-wood
1 Asked on November 20, 2021
1 Asked on November 13, 2021 by guillermo-mosse
1 Asked on November 10, 2021 by ksona
1 Asked on September 30, 2021 by mark-jones
2 Asked on September 30, 2021 by patrick-lamatiere
Get help from others!