TransWikia.com

Outlier detection for a univariate categorical variable?

Cross Validated Asked by Abdu on February 20, 2021

Does anyone know an outlier detection method for a univariate categorical (nominal, unordered) statistical variable? Without any assumptions about the categorical variable distribution (non-parametric method)?

3 Answers

As per my understanding, there is no concept of outliers detection in categorical variables(nominal), as each value is count as labels. Based on frequency(Mode), we can't do outliers treatment for categorical variables. Plz prove me wrong :)..

Answered by Kapil on February 20, 2021

Outliers are extreme values that we come across, where they may be influential to the model or not. When it comes to categorical data (say Gender: as in male and female). There's no way of any outlier detection in that. If you mean something like this: You take a sample of 10 with 9 males and 1 female. So you mean that "1 female" is an outlier? NO! It's just the composition of the sample which you have selected.

Answered by Dovini Jayasinghe on February 20, 2021

Think about your question once more because you ask for an algorithm to detect which of these is an outlier:

  • London
  • Munich
  • Paris
  • Barcelona

Nominal scale means that you have just labels of items like city names or car brands. You can't tell which is an outlier without additional info.

Answered by Silvestris on February 20, 2021

Add your own answers!

Ask a Question

Get help from others!

© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP