Skip to content

oem for real data analysis #16

Open
@LauraTurbatu

Description

@LauraTurbatu

I tryed using the oem package for variable selection for a quite large dataset, about 1 million observations and several variabales, around 60, both numerical and categorical. I used the sparse.model.matrix to transform the categorical to numerical, so I get some 1500 colons to the dataset. But then the X^TX is no longer invertible and so the oem function does not work. Can you recommend me what can I do when I want to do variable selection in a dataset with huge amount of lines and some categorical explanatory variables and also a few numerical ?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions