Pairing lovers story – Machine Learning Stories

Your partner (for the fluent writings let’s say it’s a woman), the psychology student, got the grant for the research about the relationships. She created a pretty big collection of 12933 questionnaires describing the personalities of the people being in a long-term relationship, and pairs of the people from the broken relationships.

Because your girlfriend knows you’re working with data modeling, she has a small job after hours – let’s create a classifier that will help her to promote the research. Do the two people match?

At the beginning of the project:

	Spend a few days checking what is a human-level performance for this task so that you can define your target
	Spend a few days training a basic model and see what mistakes it makes.
	Spend a few days getting more data because usually, you need a lot bigger datasets than 13k examples

Correct

Incorrect

You trained a very first model. Errors (100%-Accuracy) are:

Training set	14.0%
Dev set	15.5%

Which sentence do you agree with?

	14.0% of training errors shows you have a high bias
	You have a higher bias than a variance
	Probably you have a higher variance than a bias
	None of the above

Correct

Incorrect

You ask your girlfriend if there’s something you can define as “human-level performance.” She says that she can try to classify the pairs questionnaires on the knowledge and intuition she has. How many examples should she classify?

	The entire dev set
	The entire test set
	The entire train set
	Sampled examples from the train set

Correct

Incorrect

Based on your girlfriend's score, and still working on the model, you have:

Human-level performance	7.0%
Training set	12.0%
Dev set	12.5%

Which two of the following options are the most promising?

	Get a bigger training set to reduce variance
	Train a bigger model to try to do better on the training set.
	Try increasing regularization.
	Try decreasing regularization. It ok to try it

Correct

Incorrect

You also evaluate your model on the test set, and find the following:

Human-level performance	7.0%
Training set	12.0%
Dev set	12.5%
Test set	20.5%

	You should try to get a bigger dev set.
	You are overfitting to the dev set.
	You should get a bigger test set.
	You are overfitting to the test set.

Correct

Incorrect

Your friend has called you on a telephone and he wants to take the advice from you. Surprisingly he’s having a very similar problem. His results on some dataset are as follows:

Human-level performance	1.0%
Training set	1.2%
Dev set	1.2%
Test set	0.8%

What’s the best advice for him?

	You should try to get a bigger dev set.
	You are overfitting to the dev set.
	You should get a bigger test set.
	You are overfitting to the test set.

Correct

Incorrect

Your girlfriend found a similar experiment on the internet, but only some of the open questions are the same. You want to use the data. Where can it be added?

	To the training set
	To the development set
	To the test set

Correct

Incorrect

After further work and adding the new data to the train set, and creating new, bigger splits, you’re getting the following results:

Human-level performance	7.0%
Training set	7.1%
Dev set	12.4%
Test set	12.5%

	Your algorithm overfits the dev set because the error of the dev and test sets are very similar.
	You have a large variance problem because your training error is quite higher than the human-level error.
	You have a large avoidable-bias problem
	You have a large data-mismatch problem

Correct

Incorrect

Based on the table from the previous question, your girlfriend thinks that the
training dataset is easier than the dev and test sets. Do you agree?

	The algorithm does better on the distribution of data it trained on, so it’s normal behavior, she’s not right.
	If the score is better on the train data it has to be easier indeed.
	You don’t know if it’s because the algorithm is trained on train set or if it really is easier.

Correct

Incorrect

After working further on the problem, you’ve noticed that some questionnaires have a lot of empty or almost empty answers on the test set. You think that you should delete these questionnaires form the test set. Should you also delete incorrectly filled questionnaires from the dev or train set?

	You should delete incorrectly filled questionnaires in the training set as well so as to avoid your training set now being more different from your dev set.
	You should delete incorrectly filled questionnaires in the dev set as well so as to make sure that your dev and test data come from the same distribution

Correct

Incorrect

You’ve filled the questionnaire together with your girlfriend and you have the final score: unfortunately, you don’t fit together, having the 28% probability of fitting. You and your girlfriend feel different, though. What does it mean?

	Your test error is 7%, so probably you are one of the 7% mistakes.
	The chance for your relationship to last is very small, so you should break your relationship

Correct

Incorrect

Leave a Comment Cancel reply