A hybrid deep learning and fuzzy logic framework for feature-based evaluation of english Language learners

一种基于特征的英语学习者评估的混合深度学习和模糊逻辑框架

阅读:1

Abstract

The integration of artificial intelligence (AI) and natural language processing (NLP) into language learning and assessment has unlocked new possibilities for accurately profiling English language learners (ELLs) and personalizing educational interventions. While previous studies have typically focused on isolated techniques either deep learning, traditional machine learning, or linguistic rule-based models there remains a critical need for comprehensive frameworks that combine the interpretability of rule-based reasoning with the predictive power of advanced AI. Addressing this gap, the present study introduces a novel hybrid methodology for ELL evaluation, integrating both rule mining through fuzzy logic and a state-of-the-art fusion model that integrates DeBERTa, metadata features, and LSTM architectures. This approach employs a hybrid DeBERTa + Metadata + LSTM (DBML) model, where DeBERTa serves as a transformer backbone to extract rich textual embeddings via attention mechanisms, Metadata features capture contextual, cognitive, and demographic learner traits, and LSTM layers are utilized for effective temporal modeling and dense integration. This comprehensive pipeline allows for complex prediction of language proficiency levels, dealing with both unstructured (text response) and structured (behavioral and demographic) data streams. Empirical comparisons against standard machine learning, deep learning, and standalone transformer models demonstrate the superiority of the proposed hybrid approach, achieving a peak accuracy of 93% significantly higher than benchmarked baselines. Furthermore, the study extensively investigates model reliability using statistical significance tests and eXplainable AI (XAI) techniques such as SHAP and DeepSHAP. These analyses not only confirm the model's robustness but also reveal the centrality of linguistic attributes (e.g., Syntax, Cohesion, Vocabulary) in classification, as further substantiated by comprehensive feature ranking including Information Gain, Gain Ratio, Gini Index, and permutation importance based on random forest algorithm for fuzzy rule extraction for top features.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。