Diseases related to the central nervous system (CNS) are major health concerns and have serious social and economic impacts. Developing new drugs for CNS-related disorders presents a major challenge as it actively involves delivering drugs into the CNS. Therefore, it is imperative to develop in silico methodologies to reliably identify potential lead compounds that can penetrate the blood–brain barrier (BBB) and help to thoroughly understand the role of different physicochemical properties fundamental to the BBB permeation of molecules. In this study, we have analysed the chemical space of the CNS drugs and compared it to the non-CNS-approved drugs. Additionally, we have collected a feature selection dataset from Muehlbacher et al. (J Comput Aided Mol Des 25(12):1095–1106, 2011. 10.1007/s10822-011-9478-1) and an in-house dataset. This information was utilised to design a molecular fingerprint that was used to train machine learning (ML) models. The best-performing models reported in this study achieved accuracies of 0.997 and 0.98, sensitivities of 1.0 and 0.992, specificities of 0.971 and 0.962, MCCs of 0.984 and 0.958, and ROC-AUCs of 0.997 and 0.999 on an imbalanced and a balanced dataset, respectively. They demonstrated overall good accuracies and sensitivities in the blind validation dataset. The reported models can be applied for fast and early screening drug-like molecules with BBB potential. Furthermore, the bbbPythoN package can be used by the research community to both produce the BBB-specific molecular fingerprints and employ the models mentioned earlier for BBB-permeability prediction.
Scientific Reports.
2024;14(1):8908. doi: 10.1038/s41598-024-59734-9
Keywords
Library Collection(s)