I think it would be useful to create a model that tries to predict whether a youtube comment is paedophilic – maybe the model should also take into account the channel name/description/front image.
It’s not an easy task but at the moment I’m just looking for data.
I know it’s a sensitive topic – but does anyone know of a dataset out there with the characteristics I need?
Sexually Abusive Comments and specific words collection from popular youtube videos such as music videos and cartoons (Peppa Pig)
Trending YouTube Video Statistics and Comments Daily statistics (views, likes, category, comments+) for trending YouTube videos
The dataset includes data gathered from videos on YouTube that are contained within the trending category each day.
There are two kinds of data files, one includes comments and one includes video statistics. They are linked by the unique video_id field.
If you are interested to generate your own dataset the below article might be helpful.
Answered by Pluviophile on November 13, 2021
1 Asked on September 29, 2021 by alotropico
2 Asked on September 29, 2021 by e-k
1 Asked on September 29, 2021
2 Asked on September 29, 2021 by upquark
0 Asked on January 31, 2021 by benten
0 Asked on January 1, 2021 by cheesus
0 Asked on December 20, 2020 by botnzd
0 Asked on December 2, 2020 by folderj
1 Asked on October 22, 2020 by manius
Get help from others!