Uses the KS test D-statistic as feature importance score. Captures differences in distribution shape, location, and zero-frequency between two classes without assuming a parametric distribution.
Details
The KS test compares Empirical Cumulative Distribution Functions (ECDF). If one class has 80 a massive difference captured by the D statistic (bounded [0, 1]).
Higher D values indicate better class separation.
Dictionary
This Filter can be instantiated via the dictionary `mlr_filters` or with the associated sugar function `flt()`: “` flt("gof_ks") mlr_filters$get("gof_ks") “`
Super class
mlr3filters::Filter -> FilterGOF_KS