Student Performance Evaluation Analysis

by Olusola Timothy Ogundepo

Introduction

PISA is a survey of students skills and knowledge as they approach the end of compulsory education. It is not a conventional school test. Rather than examining how well students have learned the school curriculum, it looks at how well prepared they are for life beyond school. Around 510,000 students in 65 economies took part in the PISA 2012 assessment of reading, mathematics and science representing about 28 million 15-year-olds globally.

Preliminary Wrangling

What is the structure of your dataset?

The pisa dataset contains over 400,000 responses from different students with more than 600 features. But for the purpose of this analysis and visualization, I will be using 200,000 sample of the dataset with 29 features. The features are of variety of format such as nominal, ordinal, discrete, continuous, text etc.,

What is/are the main feature(s) of interest in your dataset?

What features in the dataset do you think will help support your investigation into your feature(s) of interest?

Rename all columns to their appropriate names

After renaming all columns of the dataset

Identify and Fix issues

Data type of all features

Student id is integer instead of object

Checking the unique items of:

days skipped in a week

There exist some trailing spaces which needs to be fixed. And nan should be replaced with None for consistency.

Number of classes skipped in a week

Country

Some countries have leading or trailing spaces

Checking for duplicates in the dataset

Data has no duplicates

Parent school level

Parent school level columns have trailing spaces and most school level start and end with < and >. It will be appropriate if the trailing spaces are removed and structure the values correctly e.g.,

Function to order categorical columns

Parent school levels should be ordered

Parent Job Status

Job status has some trailing space which needs to be fixed and it should be ordinal category.

Primary education

There exist some trailing spaces which needs to be fixed.

It will be great if student experience in primary are school ordinal category

Possessions

To reduce memory usage and internal order, it will be ideal to convert this columns to category with the order of ['No', 'Yes']

After changing the type

Gender

It will be reasonable to place both math interest and anxiety in the following order:

After the ordering has been corrected

Student failure attribute

It would be logical if student failure attributes were placed in an ordinal category.

Summary analysis of the dataset

Saving the processed data

Univariate Exploration

In this section, investigation will be done on the distribution of individual variables.

Most common country and gender

Which country has the highest number of students that filled out the survey and whose gender is the highest?

Most of the students that fill out the survey come from Spain, Canada, and Brazil, with the highest female gender. Did females perform better in education than male students? Which country has the highest student performance in examinations?

Students Primary Education

Do students have a primary education when they are young? How many years did it take to finish their primary education? What was the age of the majority of students then?

The majority of students attended primary school for more than one year. Does primary educational knowledge of students increase or boost their scores in international examinations?

The age of most students during their primary education falls between the 4–8 age bracket, with fewer students older than that. Does early education influence student performance in education?

Possession

Investigate the basic possessions of students. Did students possess a room, a computer, the internet, and a textbook to prepare for their education?

In this dataset, many students possess a room and a computer, but some don't; e.g., 30,000 students didn't have a room; 18,800 students didn't have room to study; 18,100 students didn't have a computer to do research and learn more. How will these students perform in their examinations?

A high number of students possessed both the internet and textbooks, but quite a few didn't. Will students without textbooks to read or the internet to learn pass their examinations?

Parent Education Level & Earnings

Investigate the educational level of the parents. What are the most common levels of educational achievement for both parents?

There is a right-skewed distribution in both parent educational levels, i.e., approximately half of the students in the study have a parent with some college education. How well did students that had parents with a degree perform compared to those with none? Does parent education have an influence on the performance of students?

Parent Job Status

Since approximately half of the students' parents have a high degree, are they employed full-time or part-time?

A high number of students' parents are employed full-time, with a slight number of retirees and part-time workers. Does a student with a working full-time parent perform higher than those without?

Distribution of Parent Earning (Pct)

The distribution of parent earnings (Pct) is fairly spread out, with an approximate mean of 44 and 45, respectively, and a standard deviation of 22 (with both a minimum and maximum of 11 and 89, respectively). Does parental earning affect the performance of students in school?

Math attributes

Are students interested in mathematics?

According to the survey data, many students agree to be interested in mathematics while some disagree, so as the anxiety towards mathematics. Is it because of the teacher's behaviour or the teacher's support? Or does primary education contribute to their interest in or anxiety about mathematics?

The distribution of the teacher's behaviour column tends to be a uni-modal distribution (i.e., between-1 and 2), while the teacher's support rate is skewed to the left. There are observable extreme values in this distribution that may warrant further investigation (negative values and some values towards 3).

Distribution of Students performance in different subjects

Examine the distribution of student scores in different subjects. Are there any obvious differences in the distribution?

The distribution of student scores is normally distributed with a mean between (470-480) and a standard deviation of (101-102). What contributes to the success of the students with the highest scores?

Discuss the distribution(s) of your variable(s) of interest. Were there any unusual points? Did you need to perform any transformations?

Of the features you investigated, were there any unusual distributions? Did you perform any operations on the data to tidy, adjust, or change the form of the data? If so, why did you do this?

Bivariate Exploration

In this section, all pairs of variables in the data will be investigated so as to identify the relationships between them.

Countries with highest student score

Which countries always have student mathematics scores between the 25th and 75th quantile?

The math scores of students in Belgium, the Czech Republic, Germany, and Switzerland are always between the range of 460 and 550. What contributed to the performance of this student? Do they possess the internet, a room, a computer, and textbooks?

Possession effect on student academic performance

How well does possession contribute to the performance of students in their academics?

Most students that have a higher score in mathematics possess a study room, a computer, the internet, and a textbook. It's possible that students without computers, internet, and textbooks do borrow from their friends that have them because they also try.

Still, students with access to all these basic amenities perform excellently in reading examinations.

Students with all the basic amenities perform more than those without in science examinations.

Investigation on students at Belgium

Did most students with high performance in exams have access to all basic amenities such as rooms, study places, computers, and the internet?

Not all students with high performance in Belgium possess computers, study places, and the internet, but those with the highest scores have access.

The International Examination does not depend on whether students have access to any possessions because their general knowledge is being tested. It is safe to conclude that students that perform poorly in mathematics also have an area of strength. Are students with high performance male or female?

Primary Education on Academic performance

What is the effect of primary education on academic performance?

There is a significant increase in the scores of students based on their experience in primary education. Primary education contributes excellently to the performance of students in mathematics.

As can be seen from the above visuals, primary education has a positive influence on student examination scores, with a significant positive increase from those that don't attend to those that attend more than a year.

Early Education Influence

Does early education influence student performance in education?

The age at which students start their primary education has a great influence on their academic performance. As can be seen from the visuals, students that start primary education at the 4–7 age bracket perform better in their academics than those that are older before starting.

Parent Education Influence on the Performance of the Students

How well did students that had parents with a degree perform compared to those with none? Does parent education have an influence on the performance of students? Do students from higher-education families outperform those from lower-education families?

Student with highly educated parent outshines most students without educated parent.

Parent Employment Status influence on Student Performance

How do students from low-income families perform on their exams? Does a student's parent status influence how well they perform on an exam?

Students with a parent looking for a job tend to perform quite little as compared to those with full-time working parents. It is safe to say parent occupation is related to student performance.

Parent Earning effect on Student Education Performance

How well did students with high income parents perform as compared to low income? Are there any low-income parents whose children are performing better in examinations?

There is a slight upward trend between parent earnings and the performance of students in education. But this dedunction is not always the case because there are some students with low-earning parents who are intelligent enough to perform better in examinations. As can be seen from the visual, some students with low-earning parents perform quite well!

Relationship between teacher's behaviour and student examination score

Interestingly, the examination scores of students for all rated teachers are quite similar.

Influence of students behaviour on the examination score

It is quite surprising that the behaviour of a student with the highest score in mathematics always falls between the range of 0–1.

Gender role on Academic Performance

How does gender play a role in student examination scores? Did male students perform better in reading? Are female students good at science?

Male students seem to have higher scores in mathematics and science than females, while females perform extremely well in reading compared to males.

Subject Anxiety on Gender

Which gender has more anxiety when it comes to math, male or female?

As can be seen, a large number of female students agree that they are anxious about mathematics, whereas many male students disagree. This attitude towards mathematics also contributes to their examination scores, in which male students perform significantly better than females.

Talk about some of the relationships you observed in this part of the investigation. How did the feature(s) of interest vary with other features in the dataset?

Did you observe any interesting relationships between the other features (not the main feature(s) of interest)?

Multivariate Exploration

In this section, an in-depth variable association will be done to see more interesting relationship.

Gender, Score and Country

In most countries, male students perform higher than females in mathematics.

Primary Education and Possession of study place effect on Student Performance

The effect of primary education on math scores varies based on whether students possess a study place. Most students with the best performance in mathematics have a place of study.

Student age in Primary Education effect on their reading performance

There tends to be consistency in the performance scores of students that are between the age range of 4–8 years when in primary education, where students with parents having a college degree or more perform well in the reading exam.

Student Interest effect on their Performance in Examination

How does the interest of students affect their examination scores? Is it gender specific?

It is clear that the stronger the interest of students in mathematics, the higher their exam scores. And this logic applies to both males and females, i.e., it is not gender specific.

Relationships between all numerical features

Is there a relationship between students' examination scores in different subjects? Does the teacher's support have any relationship with the student's examination score?

It can be clearly seen that there is a 90% correlation between student examination scores in various subjects. Teacher support has no relationship with the student's examination score. Parents' earnings contribute a lot to the scores of students in their examinations.

Parent Earning Effect on Students score

There is an upward trend between parent earning and the performance of students in examinations. The level of consistency in the upward trend in the scores of students that possess a study place is more smooth as compared to those that have no study place, i.e., there is a consistent increase in the scores of students based on the earnings of their parents.

Talk about some of the relationships you observed in this part of the investigation. Were there features that strengthened each other in terms of looking at your feature(s) of interest?

Were there any interesting or surprising interactions between features?

Conclusions

When a student's basic school foundation is strong, their performance will always be exceptional. This study has contributed to the confirmation that children perform better academically the sooner they begin their schooling. Because most values that will contribute to knowledge demand money, parent education and occupation has a significant impact on their children's academic and vocational performance. As is widely known, literate parents will approach schooling very differently than uneducated parents. Most students who have a reading room, a book to read, a computer to conduct research on, and access to the internet to browse do well in school and achieve higher exam scores, but this is quite different for general knowledge (international examination) because every student has a different area of strength and weakness.

Results demonstrate that a student's performance is significantly influenced by how interested they are in a study. Students do better when they get more interested in a subject. Overall, parents' involvement in their children's education has a significant impact on how well they succeed.