ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.13983
36
1

DirectMultiStep: Direct Route Generation for Multistep Retrosynthesis

22 May 2024
Yu Shee
Haote Li
Anton Morgunov
Victor S. Batista
ArXivPDFHTML
Abstract

Traditional computer-aided synthesis planning (CASP) methods rely on iterative single-step predictions, leading to exponential search space growth that limits efficiency and scalability. We introduce a series of transformer-based models, that leverage a mixture of experts approach to directly generate multistep synthetic routes as a single string, conditionally predicting each transformation based on all preceding ones. Our DMS Explorer XL model, which requires only target compounds as input, outperforms state-of-the-art methods on the PaRoutes dataset with 1.9x and 3.1x improvements in Top-1 accuracy on the n1_11​ and n5_55​ test sets, respectively. Providing additional information, such as the desired number of steps and starting materials, enables both a reduction in model size and an increase in accuracy, highlighting the benefits of incorporating more constraints into the prediction process. The top-performing DMS-Flex (Duo) model scores 25-50% higher on Top-1 and Top-10 accuracies for both n1_11​ and n5_55​ sets. Additionally, our models successfully predict routes for FDA-approved drugs not included in the training data, demonstrating strong generalization capabilities. While the limited diversity of the training set may affect performance on less common reaction types, our multistep-first approach presents a promising direction towards fully automated retrosynthetic planning.

View on arXiv
@article{shee2025_2405.13983,
  title={ DirectMultiStep: Direct Route Generation for Multistep Retrosynthesis },
  author={ Yu Shee and Anton Morgunov and Haote Li and Victor S. Batista },
  journal={arXiv preprint arXiv:2405.13983},
  year={ 2025 }
}
Comments on this paper