What is next?

The below questions for discussion can help consolidate your learning. How many can you answer?

What is the “right” way to split data for machine learning?
- Would that work for time-series data?
- What if you had parameters you wanted to learn?
- What if you wanted to compare models?
If we had just one number to score a model, what would you choose?
- When is it good?
- When is it bad?
How do you find out more information about a model?
If you run a model and get an answer, and a colleague ran an analysis independently and came to a different conclusion, how would you go about explaining the discrepancy?
What makes you think that the analysis is “right”? How would you know if something was wrong?
Did you test a hypothesis? If so which hypothesis?
- Is there an area of your research where this is applicable?
- What about related work on slightly different problems? Are the hypotheses the same?
How to handle missing data