Abstract
The accurate prediction of human movement direction plays a crucial role in fields such as rehabilitation monitoring, sports science, and intelligent military systems. Based on plantar pressure and inertial sensor data, this study developed a hybrid deep learning model integrating a Convolutional Neural Network (CNN) and a Bidirectional Long Short-Term Memory (BiLSTM) network to enable joint spatiotemporal feature learning. Systematic comparative experiments involving four distinct deep learning models-CNN, BiLSTM, CNN-LSTM, and CNN-BiLSTM-were conducted to evaluate their convergence performance and prediction accuracy comprehensively. Results show that the CNN-BiLSTM model outperforms the other three models, achieving the lowest RMSE (0.26) and MAE (0.14) on the test set, with an R(2) of 0.86, which indicates superior fitting accuracy and generalization ability. The superior performance of the CNN-BiLSTM model is attributed to its ability to effectively capture local spatial features via CNN and model bidirectional temporal dependencies via BiLSTM, thus demonstrating strong adaptability for complex motion scenarios. This work focuses on the optimization and comparison of deep learning algorithms for spatiotemporal feature extraction, providing a reliable framework for real-time human motion prediction and offering potential applications in intelligent gait analysis, wearable monitoring, and adaptive human-machine interaction.