Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Midterm Report Review from jx255 #6

Open
ANGELA6XU opened this issue Nov 2, 2016 · 0 comments
Open

Midterm Report Review from jx255 #6

ANGELA6XU opened this issue Nov 2, 2016 · 0 comments

Comments

@ANGELA6XU
Copy link

In general, the structure of this report is straightforward and concrete. It shows a clear process from data cleaning, feature selections, model selections and future development. The histograms and descriptive statistics can help illustrate and determine the feature selection well.

For cleaning the original data set, I have noticed that around 10k listings have been deleted. So, what concerns to me is that will this deletion method for dealing with missing/outliers cause information loss? It would be great if you would try other methods for handling the missing data, such as Mean/Mode completer or k-means clustering.

Besides the ridge regression, it would be great if you could try different other models and describe the reasons for model selection and interpretations in more details.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant