Skip to content
@GT4SD

Generative Toolkit 4 Scientific Discovery

GT4SD organization

🙋‍♀️ What is GT4SD?

Our mission is to accelerate scientific discovery by creating an open community around generative models for science 🚀

Technologies like generative models need to be an instrument that scientists use to carry out their research quicker and more effectively, rather than something that requires very specific domain knowledge to utilize.

To this end, we created gt4sd-core, an open-source library to accelerate hypothesis generation in the scientific discovery process that eases the adoption of state-of-the-art generative AI. GT4SD includes models that can generate new molecule designs based on properties such as target proteins, target omics profiles, scaffolds distances, binding energies, and additional targets relevant for materials and drug discovery.

The library provides an effective environment for the generation of new hypotheses (inference) and for fine-tuning the models to specific domains using custom data sets (models retraining).

GT4SD's common framework makes models easily accessible to a broader community, like AI/ML practitioners developing new generative models who want to deploy with just a few lines of code. GT4SD provides a centralized environment for scientists and students interested in using generative models in their scientific research, allowing them to access and explore a variety of different models — all of which are pretrained. Consistent commands and interfaces for inference or retraining with customizable parameters harmonize the use across the different models.

The development of problem-specific intelligence is made possible thanks to the automatic workflows enabling retraining with users' own data covering molecular structures and properties. The replacement of manual processes and human bias in the discovery process has important effects on downstream applications that rely on the use of AI models, leading to an acceleration of expert knowledge.

🌈 How can I get involved?

If you want to contribute and get involved check out our guidelines and code of conduct.

👩‍💻 How can I use it?

We are continuously evolving GT4SD but there is already plenty of stuff you can do with it.

A good starting point is to check out the library and the short introductory guide in its README.md.

You can find all implementation details and some examples in our docs.

Try out the notebooks we prepared to use generative models for various scientific applications.

If you are an end-user and not a developer, just try out some of our pretrained models via a simple web-app (no programming required 👩‍💻) gradio-apps

Pinned Loading

  1. gt4sd-core gt4sd-core Public

    GT4SD, an open-source library to accelerate hypothesis generation in the scientific discovery process.

    Jupyter Notebook 340 74

Repositories

Showing 10 of 21 repositories
  • gt4sd-core Public

    GT4SD, an open-source library to accelerate hypothesis generation in the scientific discovery process.

    GT4SD/gt4sd-core’s past year of commit activity
    Jupyter Notebook 340 MIT 74 2 (1 issue needs help) 0 Updated Nov 26, 2024
  • lm-assistant-for-biocatalysis Public

    This project focuses on streamlining and automating key tasks in protein modeling, optimization, and the design of experiments for biocatalyzed reactions

    GT4SD/lm-assistant-for-biocatalysis’s past year of commit activity
    Python 2 MIT 2 0 0 Updated Oct 29, 2024
  • enzeptional Public

    Enzeptional stand-alone python package

    GT4SD/enzeptional’s past year of commit activity
    Python 0 MIT 1 0 0 Updated Oct 24, 2024
  • he-compliant-approximation Public

    Homomorphic encryption compliant learnable approximation

    GT4SD/he-compliant-approximation’s past year of commit activity
    Python 3 MIT 2 0 0 Updated Sep 26, 2024
  • multitask_text_and_chemistry_t5 Public

    Code for "Unifying Molecular and Textual Representations via Multi-task Language Modelling" @ ICML 2023

    GT4SD/multitask_text_and_chemistry_t5’s past year of commit activity
    Jupyter Notebook 35 MIT 1 0 0 Updated Sep 9, 2024
  • GT4SD/gt4sd-trainer-hf-pl’s past year of commit activity
    Python 1 MIT 2 0 0 Updated Sep 3, 2024
  • mtl4ad Public

    Repository to fine-tune various LLMs architectures for AD on a multi-task datasets.

    GT4SD/mtl4ad’s past year of commit activity
    Python 1 MIT 0 0 1 Updated Aug 12, 2024
  • molecular-design Public

    Generative AI pipeline for target-based molecular design

    GT4SD/molecular-design’s past year of commit activity
    Python 2 MIT 1 0 0 Updated Jul 1, 2024
  • reinvent_models Public

    GT4SD fork of Reinvent Models

    GT4SD/reinvent_models’s past year of commit activity
    Python 2 MIT 4 0 0 Updated Jun 20, 2024
  • zero-shot-bert-adapters Public

    Implementation of Z-BERT-A: a zero-shot pipeline for unknown intent detection.

    GT4SD/zero-shot-bert-adapters’s past year of commit activity
    Python 38 MIT 10 3 0 Updated Jun 13, 2023

Top languages

Loading…

Most used topics

Loading…