Data Science Asked by Tarun Pratap on December 24, 2020
In data pre-processing, stratified shuffle is used to ensure that the distribution of the original dataset is reflected in the training, test and validation dataset.
Mini-batch gradient descent uses random shuffling to ensure randomness in the mini-batches.
My doubt is- Why should we implement stratified shuffle on our dataset if it is going to be shuffled in a random manner later during training?
It doesn't, the workflow when training a model is like that:
If we skip the stratified shuffling in step 1 the classes of the train set, validation set and test set wont be evenly distributed.
If we skip the shuffling before each epoch in step 3 the mini-batches in each epoch will be the same.
The proportions of the train set, validation set and test set can of course vary.
Correct answer by Tim von Känel on December 24, 2020
3 Asked on August 12, 2021 by hail-caesar
beginner machine learning recommender system similarity supervised learning
0 Asked on August 12, 2021
2 Asked on August 12, 2021
loss function multilabel classification neural network prediction pytorch
0 Asked on August 11, 2021
2 Asked on August 11, 2021 by user105599
1 Asked on August 11, 2021
deep learning image classification inception keras transfer learning
3 Asked on August 11, 2021
decision trees machine learning python text mining unsupervised learning
2 Asked on August 10, 2021 by user3778289
1 Asked on August 10, 2021 by wacken0013
0 Asked on August 10, 2021 by benedict-k
cnn convolutional neural network image classification neural network
1 Asked on August 10, 2021
deep learning learning rate loss function optimization training
0 Asked on August 10, 2021 by srinivas
0 Asked on August 10, 2021
1 Asked on August 9, 2021 by rupert
0 Asked on August 9, 2021
feature selection multiclass classification python regression
1 Asked on August 9, 2021 by opyate
Get help from others!
Recent Questions
Recent Answers
© 2022 AnswerBun.com. All rights reserved. Sites we Love: PCI Database, MenuIva, UKBizDB, Menu Kuliner, Sharing RPP