Artificial Intelligence Asked by Andreas Toresäter on September 20, 2020
I just started working with the GPT-2 models and want to retrain one on a pretty narrow topic, so I have problems finding training material.
How large should the corpus be to optimally retrain the GPT-2 model? And what is the bare minimum size? Should it simply be as large as possible or can it flip over and make the model worse in some way?
I am also not certain how many steps you should let the retraining run. I have been using 6000 steps when testing, and it seems not much happens after that, loss only moved from 0.2 to 0.18 last 1000 steps.
1 Asked on November 24, 2021
applications deep learning deepfakes generative adversarial networks
1 Asked on November 20, 2021
autoencoders deep learning machine learning neural networks unsupervised learning
1 Asked on November 20, 2021
1 Asked on November 17, 2021 by dhanush-giriyan
1 Asked on November 12, 2021
1 Asked on November 10, 2021
long short term memory machine learning open ai reinforcement learning time series
1 Asked on November 7, 2021
2 Asked on November 4, 2021
deep rl dqn neural networks reinforcement learning temporal difference methods
1 Asked on November 4, 2021
dense rewards reinforcement learning reward design reward functions reward shaping
0 Asked on November 4, 2021 by tinu
ai development machine learning papers research state of the art
1 Asked on November 4, 2021 by ijuneja
1 Asked on August 24, 2021 by kashan
1 Asked on August 24, 2021 by ram-bharadwaj
1 Asked on August 24, 2021 by metrician
epsilon greedy policy monte carlo methods notation on policy methods reinforcement learning
1 Asked on August 24, 2021 by user289602
1 Asked on August 24, 2021 by daniel-koh
0 Asked on August 24, 2021 by soitgoes
function approximation markov decision process reinforcement learning
0 Asked on August 24, 2021 by seunosiko
0 Asked on August 24, 2021 by user38639
convolutional neural networks data augmentation neural networks testing training
Get help from others!
Recent Answers
© 2022 AnswerBun.com. All rights reserved. Sites we Love: PCI Database, MenuIva, UKBizDB, Menu Kuliner, Sharing RPP, SolveDir