Abstract
The cluster-in-solvent approach, that is, the use of the quantum-mechanical calculation of a spectral observable on a significant number of solute-solvent clusters extracted from semi-classical simulations, is widely used in computational spectroscopy. However, identifying relevant coordinates for cluster selection remains a challenge. We previously developed the Ellipsoid Method for Cluster-in-Solvent (EMCS), an automated strategy for unbiased identification and statistical weighting of clusters. Yet, for larger solutes, EMCS can yield overly large solvent clusters that hinder conformational analysis. Here, we introduce a simple extension of EMCS that reduces cluster size, enabling its application to medium-to-large solutes. The method is validated through the computation of Vibrational Circular Dichroism (VCD) spectra, which are highly sensitive to solute-solvent interactions. Test cases include aqueous L-alanine, aqueous dialanine, and (1S,2S)-trans-1-amino-2-indanol in DMSO. For L-alanine and trans-1-amino-2-indanol, computed spectra closely match experiment, with root-mean-square-deviation (RMSD) values of 10.3 and 8.0, respectively, consistent with previous benchmarks. For aqueous dialanine, the main spectral features were reproduced, though discrepancies in the fine structure remain, likely due to limitations in capturing subtle solvation effects. Overall, the refined EMCS protocol enables efficient and non-arbitrary sampling of solute-solvent clusters, offering a valuable tool for the structural analysis of solvation shells in complex molecular systems.