Abstract
In the field of sustainable mobility, this study highlights the importance of using machine learning for predictive modeling based on real traffic data collected from instrumented bicycles. The advent of advanced technologies like sustainable mobility apps, sensors, and advanced data analysis methods led to the ability to collect data from various sources, which enabled researchers to estimate battery state of charge (SOC) accurately. Most current research uses them in the lab experiments for data collection. In this work, we use real-time sensors data to construct data-driven models for lithium-ion battery SOC estimation. This research integrates both electric bicycle battery, environmental and route variables to achieve the following goals: (1) Collect a multimodal data set including operational, topography, vehicle, and external variables, (2) Preprocess data obtained from sensors installed on the electric bicycle battery, (3) Create models of lithium-ion battery SOC based on electric bicycle battery and environmental variables, and (4) Assess data-driven models and compare their performance for lithium-ion battery SOC with high accuracy. To achieve that, we conducted a real study to predict the Remaining Useful Life (RUL), as a measure of state of charge, of electric bicycle battery. The study was carried out on a 15 km cycle route in Medellín, Colombia, for 28 days. To estimate the RUL, we used four different machine learning algorithms: Long Short-Term Memory (LSTM), Support Vector Regression (SVR), AdaBoost, and Gradient Boost. Notably, data preprocessing techniques played a pivotal role, with a particular focus on smoothing sensor data using Convolutional Neural Networks (CNN). The results showed a significant improvement in prediction accuracy when using data preprocessing, confirming its importance in improving model performance. Furthermore, the comparison of network performance facilitated the selection of the most effective model for the test data. This study underscores the value of using real-world data to develop and validate predictive models in the pursuit of sustainable mobility solutions, and highlights the critical role of data-driven methodologies in addressing today's urban transportation challenges.