Feature Engineering

Feature Engineering#

methods#

kernels#

The idea behind the kernel trick is to implicitly map data to a higher-dimensional space where it becomes linearly separable, enabling linear algorithms to solve nonlinear problems. This mapping is done through a kernel function, which computes the inner product between the images of two data points in this higher-dimensional space. Essentially, the kernel function measures the similarity between pairs of data points.

Feature Engineering Tools#

summary#

* continuous features
    * combination of features 
        * polynominal
        * normalization and standardization
        * change of coordinates
    * transforms:
        * kernels
        * STFT
        * wavelet
        * dictionaries

* discrete features
    * encoding (one-hot)
    * grouping (valid combinations)    

* missing value
    * dropping (features/ samples)
    * imputer by a model - nearest / interpolation / classifier / regressor

* data / time
    * day of the week/month/year

* text
    * steamming
    * lemmatization

* feature selction

* dimensionality reduction

* automatic feature generation (autoML)

* stemming vs lemmatiztion

TBD#

https://www.kaggle.com/code/willkoehrsen/automated-feature-engineering-tutorial https://www.kaggle.com/code/willkoehrsen/automated-feature-engineering-basics