TransWikia.com

How to tune parameters batch by batch?

Data Science Asked by yayaya on January 6, 2021

As the title states, I am trying to cluster a huge dataset and cluster it by using sklearn.Birch to learn incrementally.

If it’s a small dataset, I could just use gridsearchcv.

However, there’s no built-in way to do that with large dataset in scikit-learn right now.

I’m curious if there is any suitable/general way to tune parameters batch by batch?

One Answer

In Grid Search, Random Search, we trained different independent model i.e. irrespective of the outcome of previous models and later evaluate all these independent models.

We have another automatic technique for hyper-parameter optimization, known as Bayesian HyperOpt.

It took the references of previous model and try to identify the best new set of hyper param and trained a new model. This two step process gets repeated, until we get the desired results.

You can get many online references for complete implementation.

Although I could not find any approach for batch by batch, I thought its worth to mention Bayesian approach.

Answered by vipin bansal on January 6, 2021

Add your own answers!

Ask a Question

Get help from others!

© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP