Hacker News new | past | comments | ask | show | jobs | submit login

Hey, one easy way to decide features is to use a correlation matrix. The stronger the correlation coefficient r is, the more of a linear relationship exists.

The code goes like this -

install.packages('corrplot') library(corrplot)

mcor <- cor(crime) # if crime is your dataframe corrplot(mcor)

That's one easy way to start out. Perhaps I'll write a post on feature engineering.




Thank you, I will try this. And I would love to see such a post.




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: