Abstract
Prediction of student academic performance is still a problem because of the limitations of the existing methods specifically low generalizability and lack of interpretability. This study suggests a new approach that deals with the current problems and provides more reliable predictions. The proposed approach combines the information gain (IG) and Laplacian score (LS) for feature selection. In this feature selection scheme, combination of IG and LS is used for ranking features and then, Sequential Forward Selection mechanism is used for determining the most relevant indicators. Also, combination of random forest algorithm with a genetic algorithm for is introduced for multi-class classification. This approach strives to attain more accuracy and reliability than current techniques. The case study shows the proposed strategy can predict performance of students with average accuracy of 93.11 % which shows a minimum improvement of 2.25 % compared to the baseline methods. The findings were further confirmed by the analysis of different evaluation metrics (Accuracy, Precision, Recall, F-Measure) to prove the efficiency of the proposed mechanism.