Gyori Lab for Computational Biomedicine

This website describes projects that are led by Benjamin M. Gyori at Northeastern University since August 2023. Previously, the research group was based at the Laboratory of Systems Pharmacology, Harvard Program in Therapeutic Science (HiTS), at Harvard Medical School.

Introduction

Our group is developing systems that can accelerate scientific discovery in biomedicine using a combination of text mining, knowledge assembly, mathematical modeling and causal analysis. We are pursuing approaches that use artificial intelligence to increasingly automate the interpretation of scientific literature and large experimental datasets, and also to enable sophisticated human-machine interaction and collaboration. Applications of these tools range from drug discovery for cancer and other diseases to modeling complex processes at the interface of physical and social systems.

Applications

INDRA

INDRA (Integrated Network and Dynamical Reasoning Assembler) is an automated model assembly system that uses natural language processing and structured databases to collect mechanistic and causal assertions, represent them in a standardized form, and assemble them into causal graphs or dynamical models. Internally, INDRA performs knowledge assembly to correct errors, find and resolve redundancies, infer missing information, filter to a scope of interest and assess belief.
Code Docs REST API

INDRA Biomedical Discovery Engine

The INDRA Biomedical Discovery Engine is an integrated web portal that builds around an automatically assemble knowledge graph combining causal mechanisms assembled by INDRA with relations repesenting ontologies and data. It supports uploading user data to run analysis as well as the browsing and curation of different facets of biomedical knowledge. The website also contains an integrated human-machine dialogue system to sequentially explore the knowledge graph.
Website Code

INDRA Database

The INDRA Database makes knowledge assembled by INDRA at scale available as a service. It aggregates knowledge extracted by multiple machine-reading systems from all available abstracts and open-access full text articles, and combines this with mechanisms from pathway databases. Queries allow searching for genes, chemicals, biological processes and other concepts of interest, and returns a ranked list of relevant interactions.
Website REST API Code

MIRA

MIRA is a meta-modeling framework that enables rapid prototyping of models across scientific domains including epidemiology, ecology, and systems biology. It uses ontology-grounded templates to represent model structure, allowing the same model to be exported to multiple simulation formats and enabling automated model comparison and assembly.
Code Docs

EMMAA

EMMAA (Ecosystem of Machine-maintained Models with Automated Analysis) monitors the scientific literature for new findings and automatically updates a set of disease-specific models with new knowledge. It also automatically analyzes these models against a set of test conditions (typically experimental observations) and measures the effect of new knowledge on these results. It then notifies users about relevant new analysis results.
Website Code Docs REST API

INDRA-IPM (Interactive Pathway Map)

The INDRA-IPM allows you to build pathway maps using natural language descriptions. You simply describe the set of mechanisms to include in English, and then click a button to assemble and lay out a pathway map. The pathway can be exported into various formats like SBML, SBGN, Kappa and others.
Website Code

CLARE machine assistant

CLARE is a machine assistant that can be deployed as a Slack application in a workspace and engage in human-machine dialogue in channels and private messages. It can answer questions about mechanisms such as "what phosphorylates ELK1?" or "does RHOA interact with MYL12B?" and connect this information to other resources, while also allowing to build and discuss models of mechanisms. Please contact us if you'd like to deploy CLARE on your Slack workspace.
Demo video

INDRA World

In the context of the DARPA World Modelers program, we have generalized INDRA to modeling complex causal mechanisms governing processes such as agricultural production, food security and migration. We were also part of the DSMT-E project, a large collaboration funded by DARPA and the Gates Foundation to develop AI-driven decision support tools for Ethiopia.
Code World Modelers technical website

Network search and DepMap explainer

The INDRA Network search builds on the INDRA assembly of literature extractions and pathway databases to find mechanistic paths between entities of interest. It allows searching for a variety of patterns (paths between, common up/downstreams, etc.) and tuning multiple search parameters to define constraints and context. The DepMap explainer is a specific instance of network search aimed at constructing explanations for correlations between genes involved in CRISPR screens of cancer cell lines found at depmap.org.
DepMap explainer Network search

Application to studying pain and inflammation

In the context of the DARPA Panacea program, the team has worked on understanding the regulation of pain and inflammation with the goal of finding new therapeutics using INDRA.
Website

Gilda

Gilda is a Python package and REST service that grounds (i.e., finds appropriate identifiers in namespaces for) named entities in biomedical text. It also uses a set of machine-learned disambiguation models to choose between different senses of ambiguous synonyms. It can be integrated into applications as a Python package or through the REST service.
Website and REST API Code

Adeft

Adeft (Acromine based Disambiguation of Entities From Text context) builds machine-learning models to disambiguate acronyms and other abbreviations of biological terms in the scientific literature. A growing number of pretrained disambiguation models are publicly available through the Python package.
Code

Biopragmatics stack

We developed the Biopragmatics stack, a collection of interlinked software packages that provide infrastructure for working with biomedical ontologies and their entries. They include the Bioregistry (a meta-registry of biomedical nomenclatures), Bioversions (a service for tracking versions of biomedical resources), Biomappings (community curated equivalences across biomedical ontologies), Biolookup (a service for retrieving metadata about biomedical entities), and PyOBO (a software package to facilitate processing biomedical ontologies in a unified fashion).
Website Code

Projects and Funding

Current

Automating Scientific Knowledge Extraction and Modeling (ASKEM) The DARPA ASKEM program aims to develop technologies to significantly speed up how experts create, maintain and analyze models of complex processes such as viral pandemics and space weather. We are developing the MIRA system which is a framework for representing systems using ontology-grounded meta-model templates, and generating various model implementations and exchange formats from these templates. It also implements algorithms for assembling and querying domain knowledge graphs in support of modeling.
Funded by the Defense Advanced Research Projects Agency under award HR00112220036. (2022-)

Biomedical Data Frabric (BDF) The ARPA-H BDF program develops technologies for large-scale automated data integration to make it possible to extract more value out of data. Our lab is developing new approaches to semantic integration, automated data annotation, knowledge extraction and assembly, automated hypothesis generation, and human-machine dialogue.
Funded by ARPA-H via DARPA award HR00112220036. (2023-)

Rapid Assessment of Platform Technologies to Expedite Response (RAPTER) The RAPTER program aims to create a machine-learning tool to revolutionize the future of vaccine development, rapidly choosing a suitable vaccine platform for any viral and bacterial pathogen. Within RAPTER, we are developing text mining, knowledge assembly and knowledge graph technology to create an actionable repository of vaccine/pathogen knowledge from which models can be derived.
Funded by the Defense Threat Reduction Agency. (2023-)

Bioregistry The Bioregistry is a meta-registry that catalogues and harmonizes the landscape of biomedical nomenclature resources, providing standardized access to identifiers across databases and ontologies. It enables seamless data integration by resolving inconsistencies between overlapping resources and offering unified APIs and tools for working with biomedical identifiers.
Funded by the Chan Zuckerberg Initiative (CZI) under award 2023-329850. (2023-)

Cause of Death Determination Assistant (CODA) We are developing a software platform for AI-assisted cause of death determination in low-resource settings. The system processes unstructured dialogue from verbal autopsy interviews, grounds key terms in clinical terminologies using a knowledge graph, and interfaces with cause of death determination algorithms to provide recommendations for follow-up questions and diagnostic tests.
Funded by the Gates Foundation under award INV-094249. (2025-)

Past

Big Mechanism The DARPA Big Mechanism program set out to automate the reading, assembly and modeling of mechanisms from the scientific literature. We built INDRA, an automated model assembly system which draws on natural language processing systems, and assembles their output into various predictive and explanatory models.
Funded by the Defense Advanced Research Projects Agency under award W911NF-14-1-0397 (2014-2019).

Communicating with Computers The DARPA Communicating with Computers (CwC) program develops technologies for a new generation of human-machine interaction in which machines act as proactive collaborators rather than merely problem-solving tools. We are developing an interactive dialogue system which allows scientists to interact with a computer partner – one that is able to harness knowledge extracted from the biomedical literature – to construct and test hypotheses about molecular systems.
Funded by the Defense Advanced Research Projects Agency under award W911NF-15-1-0544 (2015-2021).

Automated Scientific Discovery Framework The DARPA Automated Scientific Discovery Framework program (ASDF) will develop algorithms and software for reasoning about complex mechanisms operating in the natural world, explaining large-scale data, assisting humans in generating actionable, model-based hypotheses and testing these hypotheses empirically.
Funded by the Defense Advanced Research Projects Agency under award W911NF018-1-0124. (2018-2020)

World Modelers The DARPA World Modelers program aims to develop automated information collection and computational modeling techniques to understand the complex dynamics of global processes such as food security, migration and public health. We are developing the INDRA-GEM (Integrated Network and Dynamical Reasoning Assembler for Generalized Ensemble Modeling) automated model assembly system, which integrates information from diverse sources and implements novel probabilistic assembly techniques that can account for the uncertain nature of information in models.
Funded by the Defense Advanced Research Projects Agency under award W911NF-18-1-0014. (2017-2022)

Automating Scientific Knowledge Extraction The DARPA ASKE program is part of DARPA's broader Artificial Intelligence Exploration program with the goal of developing technologies for the "Third Wave" of AI. We are developing EMMAA (Ecosystem of Machine-maintained Models with Automated Assembly), a set of self-updating models of cancer biology that run analysis proactively, and report about meaningful changes in conclusions to users.
Funded by the Defense Advanced Research Projects Agency under award HR00111990009. (2018-2021)

Panacea The STOP PAIN project, as part of DARPA’s Panacea program, aims to develop novel drugs for the treatment of pain and inflammation using innovative research platforms. Unlike many modern drug discovery campaigns, which are target focused, we combine target-agnostic screening with network inference tools to create causal and mechanistic networks used for identification of previously unknown target-chemical ligand relationships.
Funded by the Defense Advanced Research Projects Agency under award HR00111920022. (2019-2023)

DARPA Young Faculty Award and Director's Fellowship Award Benjamin M. Gyori received a DARPA Young Faculty Award for 2020-2022 for the project "Collaborative scientific discovery with semantically linked, machine built models". The project aims to automate the construction of models of complex mechanisms embedded in knowledge graphs, and build automated analysis and human-machine collaboration capabilities around this to enable rapid scientific discovery. Due to the initial success of the project, Ben was awarded the DARPA Director's Fellowship Award for 2022-2023 to continue the project.
Funded by the Defense Advanced Research Projects Agency under award W911NF2010255. (2020-2023)

Generative AI models for pandemic preparedness and response planning In a project funded by CAPTRS, we are developing generative and knowledge-graph-based approaches to creating pandemic scenarios to help preparedness and response.
Funded by CAPTRS (2024).

People

Benjamin M. Gyori, PhD

Associate professor GitHub Google Scholar ORCID

Klas Karis

Scientific software developer GitHub Google Scholar ORCID

Jici Jiang

PhD student GitHub Google Scholar ORCID

Zheng Liu

PhD student Google Scholar ORCID

Buz Galbraith

PhD student GitHub

Sameer Rawat

PhD student GitHub

Nihar Sanda

Research scientist GitHub Google Scholar ORCID

Madison Blain

Program manager

Rushali Mohbe

Research scientist GitHub Google Scholar

Tenzin Nanglo

Scientific software developer GitHub

Haohang Yan

Scientific software developer GitHub

Karen Sachs, PhD

Senior research scientist GitHub Google Scholar ORCID

Prasham Marfatia

Scientific software developer GitHub

Albert Steppi, PhD

Scientific software developer GitHub Google Scholar ORCID

Eric Mockler

Research scientist GitHub ORCID

Thomas Lim

Scientific software developer GitHub

Oscar Ji

Co-op student GitHub ORCID

Past members at Northeastern (2023-)

Sepideh Fouladzadeh - research assistant (2025)
Charles Tapley Hoyt, PhD - postdoc, senior scientist (2021-2024)
Mufaddal Naguthanawala - master's student (2024-2025)
Kush Thakkar - master's student (2025)
Mike Anselmi - scientific software developer (2025)
Fruzsina Kovacs - summer intern (2025)
Pruthvi Navada - master's student (2024)
Sangeetha Vempati - research assistant (2023-2024)
Tomo Oga - undergraduate student (2024-2025)
Aria Agarwal - undergraduate student (2024)
Owen Sharpe - undergraduate student (2025)
Nalika Palayoor - undergraduate student (2025)
Vatsal Mehta - undergraduate student (2023-2024)
Curie Cha - undergraduate student (2025)
Maaya Manoj - summer intern (2025)
Galileo Steinberg - summer intern (2024)
Tanay Shah - summer intern (2024)
Bihan Dasgupta - summer intern (2023)

Past members and colleagues at Harvard (2015-2023)

John Bachman, PhD - co-team lead (2015-2021)
Samuel Bunga - bioinformatics software developer (2020-2022)
Diana Kolusheva - scientific software developer (2018-2022)
Albert Steppi, PhD - scientific software developer (2018-2021)
Patrick Greene - scientific software developer (2017-2021)
Petar Todorov - scientific software developer (2016-2018)
Isabel Latorre, PhD - program manager (2016-2018)
P.S. Thiagarajan, PhD - visiting professor (2015-2018)
Daniel Milstein - scientific software developer (2018)
William Chen, PhD - project lead (2014-2015)
Andras Stirling - summer intern (2021)
Askar Kolushev - summer intern (2020)
Aditya Parmar - summer intern (2016)

Open Positions

We are looking for new lab members in multiple different positions:

PhD Candidate: Applications have to go through the university application portals: Computer Science PhD program at https://www.khoury.northeastern.edu/apply/phd-apply/ or the Bioengineering PhD program at https://bioe.northeastern.edu/academics/graduate-studies/phd-bion/. However, the lab is not anticipating recruiting new PhD students currently.
Postdoctoral Fellow: For research-oriented postdoctoral roles, please reach out directly.
Biocurator Positions: We are looking for a part-time or volunteer biocurator for the Bioregistry project. See details here.

Publications

Note that these publications are ones specific to the team's projects. Gyori's full publication list is available on Google Scholar.

2025

Published Charles Tapley Hoyt, Klas Klaris, Benjamin M. Gyori Assembly and reasoning over semantic mappings at scale for biomedical data integration Bioinformatics, 2025

Published Lindsey N Anderson, Charles Tapley Hoyt, Jeremy Zucker, Jeremy R. Teuton, Andrew D. McNaughton, Klas Klaris, Natasha N. Arokium-Christian, Jackson T. Warley, Zachary R. Stromberg, Benjamin M. Gyori, and Neeraj Kumar Computational Tools and Data Integration to Accelerate Vaccine Development: Challenges, Opportunities, and Future Directions Frontiers in Immunology, 2025

Published Vasilevsky NA, Toro S, Matentzoglu NA, ..., Benjamin M. Gyori ... Charles Tapley Hoyt, ..., Mungall CJ, Haendel MA. Mondo: Integrating Disease Terminology Across Communities. Genetics, 2025.

Published Jici Jiang, Predrag Radivojac, Benjamin M. Gyori Literature-driven extraction and computational prediction of causal statements linking genetic variants to biological processes, pathways and phenotypes PSB, 2026

Published Lo Surdo P, Iannuccelli M, Klas Karis, ... Benjamin M. Gyori, Livia Perfetto. SIGNOR 4.0: the 2025 update with focus on phosphorylation data Nucleic Acid Research, 2025

Accepted Tong Wang, Benjamin M. Gyori, Scott T. Weiss, Giulia Menichetti, Yang-Yu Liu. Revealing Interactions between Microbes, Metabolites, and Dietary Compounds using Genome-scale Analysis. Microbiome 2025

Preprint Veselina Petrova, Caitlin E. Mills, ... Sepideh Fouladzadeh, ... Marian Kalocsay, Benjamin M. Gyori, Ahmet Hoke, Peter K. Sorger, Clifford J. Woolf. Multi-Kinase Inhibition of STE20 Kinases is Neuroprotective Against Chemotherapy-Induced Axon Degeneration, SSRN, 2025

Preprint Charles Tapley Hoyt, Craig Bakker, Richard J. Callahan, Joseph Cottam, August George, Benjamin M. Gyori, Haley M. Hummel, Nathaniel Merrill, Sara Mohammad Taheri, Pruthvi Prakash Navada, Marc-Antoine Parent, Adam Rupe, Olga Vitek, Jeremy Zucker Causal identification with Y0, arXiv, 2025

Preprint Shawn ZK Tan, ..., Benjamin M. Gyori, Melissa A Haendel, Charles Tapley Hoyt ... David Osumi-Sutherland The Cell Ontology in the age of single-cell omics bioRxiv, 2025

Preprint Vartika Tewari, Devon Kohler, Sunghyun Huh, Karen Sachs, Benjamin M. Gyori, Olga Vitek. Simulated mass spectrometry-based protein abundances enable effective experimental design, 2025.

2024

Published Sara Mohammad-Taheri, Pruthvi Prakash Navada, Charles Tapley Hoyt, Jeremy Zucker, Karen Sachs, Benjamin M. Gyori and Olga Vitek. Eliater: a Python package for estimating outcomes of perturbations in biomolecular networks. Bioinformatics 2024.

Published Benjamin M. Gyori and Olga Vitek Beyond protein lists: AI-assisted interpretation of proteomic investigations in the context of evolving scientific knowledge Nature Methods 2024.

Published Charles Tapley Hoyt and Benjamin M. Gyori The O3 guidelines: open data, open code, and open infrastructure for sustainable curated scientific resources Scientific Data 2024.
Summary video: https://youtu.be/kuJsl-rRjZY

Published Jain A, Gyori BM, Hakim S, Jain A, Sun L, Petrova V, ..., Bunga S, ..., PK Sorger, CJ Woolf Nociceptor-immune interactomes reveal insult-specific immune signatures of pain. Nature Immunology, 2024

Published Tiffany J. Callahan, Ignacio J. Tripodi, Adrianne L. Stefanski, Luca Cappelletti, Sanya B. Taneja, Jordan M. Wyrwa, Elena Casiraghi, Nicolas A. Matentzoglu, Justin Reese, Jonathan C. Silverstein, Charles Tapley Hoyt, ..., Lawrence E. Hunter An open source knowledge graph ecosystem for the life sciences. Scientific Data, 2024

2023

Published Garano, M. A., ..., Gyori, B. M., ..., Hoyt, C. T., ..., Robinson, P. N. The Human Phenotype Ontology in 2024: phenotypes around the world Nucleic Acids Research, gkad1005, 2023.

Published Bachman JA, Gyori BM, Sorger PK Automated assembly of molecular mechanisms at scale from text mining and curated databases Molecular Systems Biology, e11325, 2023.

Published Hoyt, C. T., Hoyt, A. L., and Gyori, B. M. Prediction and Curation of Missing Biomedical Identifier Mappings with Biomappings Bioinformatics, 39(4), btad130, 2023.

Published Pillich RT, Chen J, Churas C, Fong D, Gyori BM, Ideker T, Karis K, Liu SN, Ono K, Pico A, Pratt D NDEx IQuery: a multi-method network gene set analysis leveraging the Network Data Exchange Bioinformatics, 39(3), btad118, 2023.

Published Lobentanzer S, Aloy P, Baumbach J, Bohar B, Danhauser K, Doğan T, Dreo J, Dunham I, Fernandez-Torras A, Gyori BM, Hartung M, Hoyt CT, Klein C, Korcsmaros T, Maier A, Mann M, Ochoa D, Pareja-Lorente E, Preusse P, Probul N, Schwikowski B, Sen B, Strauss MT, Turei D, Ulusoy E, Wodke J, Saez-Rodriguez J Democratising Knowledge Representation with BioCypher Nature Biotechnology, 2023.

Published Niarakis A, Ostaszewski M, Mazein A, ..., Gyori BM, ..., Reinhard Schneider Drug-Target identification in COVID-19 disease mechanisms using computational systems biology approaches Frontiers in Immunology, 2023.

2022

Published Hoyt CT, Balk M, Callahan, TJ, Domingo-Fernandez D, Haendel MA, Hegde HB, Himmelstein DS, Karis K, Kunze J, Lubiana T, Matentzoglu N, McMurry J, Moxon S, Mungall CJ, Rutz A, Unni DR, Willighagen E, Winston D, & Gyori BM Unifying the Identification of Biomedical Entities with the Bioregistry Scientific Data, 2022.

Published Gyori BM, Hoyt CT, Steppi A, Gilda: biomedical entity text normalization with machine-learned disambiguation as a service Bioinformatics Advances, 2022.

Published Scholten B, Guerrero Simón L, Krishnan S, Vermeulen R, Pronk A, Gyori BM, Bachman JA, Vlaanderen J, Stierum R. Automated Network Assembly of Mechanistic Literature for Informed Evidence Identification to Support Cancer Risk Assessment Environmental Health Perspectives, 2022.

Published Gyori BM and Hoyt CT. PyBioPAX: biological pathway exchange in Python Journal of Open Source Software, 2022.

Published Balabin H, Hoyt CT, Birkenbihl C, Gyori BM, Bachman JA, Kodamullil AT, Ploeger PG, Hofmann-Apitius M, Domingo-Fernandez D. STonKGs: A Sophisticated Transformer Trained on Biomedical Text and Knowledge Graphs Bioinformatics, 2022.

Published Balabin H, Hoyt CT, Gyori BM, Bachman JA, Kodamullil AT,Hofmann-Apitius M, Domingo-Fernandez D. ProtSTonKGs: A Sophisticated Transformer Trained on Protein Sequences, Text, and Knowledge Graphs SWAT4HCLS, 2022.

Published N Matentzoglu, JP Balhoff, SM Bello, C Bizon, M Brush, TJ Callahan, CG Chute, WD Duncan, CT Evelo, D Gabriel, J Graybeal, A Gray, BM Gyori, M Haendel, H Harmse, NL Harris, I Harrow, H Hegde, AL Hoyt, CT Hoyt, ..., CJ Mungall A Simple Standard for Sharing Ontological Mappings (SSSOM) Database, 2022.

Published Bonner S, Barrett IP, Ye C, Swiers R, Engkvist O, Hoyt CT, Hamilton WL Understanding the Performance of Knowledge Graph Embeddings in Drug Discovery Artificial Intelligence in the Life Sciences, 2022.

Published Doherty LM, Mills CE, Boswell SA, Liu X, Hoyt CT, Gyori BM, Buhrlage SJ, Sorger PK. Integrating multi-omics data reveals function and therapeutic potential of deubiquitinating enzymes eLife 11:e72879, 2022.
Website: DUB portal

Published Mohammad-Taheri S, Zucker J, Hoyt CT, Sachs K, Tewari V, Ness R, Vitek O Do-calculus enables causal reasoning with latent variable models Bioinformatics, 2022.

Published de Crécy-lagard V, de Hegedus RA, Arighi C, ..., Gyori BM, ..., Xu J A roadmap for the functional annotation of protein families: a community perspective Database, 2022.

Published Rozemberczki B, Hoyt CT, Gogleva A, Grabowski P, Karis K, Lamov A, Andriy N, Nilsson S, Ughetto M, Wang Y, Derr T, Gyori BM. ChemicalX: A Deep Learning Library for Drug Pair Scoring KDD, 2022.

Published Hungerford J, Chan YS, MacBride J, Gyori BM, ..., Reynolds M, Surdeanu M, Bethard S, Sharp R Taxonomy Builder: a Data-driven and User-centric Tool for Streamlining Taxonomy Construction NAACL HCI+NLP, 2022.

Published Bonner S, Barrett IP, Ye C, Swiers R, Engkvist O, Bender A, Hoyt CT, Hamilton WL A Review of Biomedical Datasets Relating to Drug Discovery: A Knowledge Graph Perspective Briefings in Bioinformatics, 2022.

Published Matentzoglu N, Goutte-Gattat D, Tan SZK, Balhoff JP, Carbon S, Caron AR, Duncan WD, Flack JE, Haendel M, Harris NL, Hogan WR, Hoyt CT, Jackson RC, Kim H, Kir H, Larralde M, McMurry JA, Overton JA, Peters B, ... Osumi-Sutherland D. Ontology Development Kit: a toolkit for building, maintaining, and standardising biomedical ontologies Database, 2022.

Preprint Mohammad-Taheri S, Tewari V, Kapre R, Rahiminasab E, Sachs K, Hoyt CT, Zucker J, Vitek O Experimental design for causal query estimation in partially observed biomolecular networks arXiv, 2022.

Preprint Bachman JA, Sorger PK, Gyori BM. Assembling a corpus of phosphoproteomic annotations using ProtMapper to normalize site information from databases and text mining bioRxiv, 2022.

Preprint Hoyt CT, Berrendorf M, Gaklin M, Tresp V, Gyori BM. A Unified Framework for Rank-based Evaluation Metrics for Link Prediction in Knowledge Graphs arXiv, 2022.

Preprint Gaklin M, Berrendorf M, Hoyt CT An Open Challenge for Inductive Link Prediction on Knowledge Graphs arXiv, 2022.

2021

Published Gyori BM, Bachman JA. From knowledge to models: automated modeling in systems and synthetic biology. Current Opinion in Systems Biology, 2021.

Published Ietswaart R, Gyori BM, Bachman JA, Sorger PK, Churchman S GeneWalk identifies relevant gene functions for a biological context using network representation learning Genome Biology, 2021 22(51).

Published Gyori BM, Bachman JA, Kolusheva D. A self-updating causal model of COVID-19 mechanisms built from the scientific literature Proceedings of the BioCreative VII Challenge Evaluation Workshop, 2021.

Published Wong J, Franz M, Siper MC, Fong D, Durupinar F, Dallago C, Luna A, Giorgi JM, Rodchenkov I, Babur O, Bachman JA, Gyori BM, Demir E, Bader G, Sander C. Author-sourced capture of pathway knowledge in computable form using Biofactoid eLife, 2021 10:e68292.

Published Ostaszewski M, Niarakis A, ... Gyori BM, Bachman JA, ..., Baling R, Schneider R. COVID-19 Disease Map, a computational knowledge repository of SARS-CoV-2 virus-host interaction mechanisms Molecular Systems Biology, 2021.

Preprint Moret N, Liu C, Gyori BM, Bachman JA, Steppi A, Taujale R, Huang LC, Hug C, Berginski M, Gomez S, Kannan N, Sorger PK. Exploring the understudied human kinome for research and therapeutic opportunities bioRxiv, 2021.

2020

Published Nam KM, Gyori BM, Amethyst SV, Bates DJ, Gunawardena J Robustness and parameter geography in post-translational modification systems PLoS Computational Biology, 2020.

Published Steppi A, Gyori BM, Bachman JA. Adeft: Acromine-based Disambiguation of Entities from Text with applications to the biomedical literature Journal of Open Source Software, 2020.

2017-2019

Published Todorov PV, Gyori BM, Bachman JA, Sorger PK. INDRA-IPM: interactive pathway modeling using natural language with automated assembly. Bioinformatics, 2019.
Website: http://pathwaymap.indra.bio

Published Sharp R, Pyarelal A, Gyori BM, Alcock K, Laparra Egoitz, Valenzuela-Escárcega MA, Nagesh A, Yadav V, Bachman JA, Tang Z, Lent H, Luo F, Paul M, Bethard S, Barnard K, Morrison C, Surdeanu M. Eidos, INDRA, & Delphi: From Free Text to Executable Causal Models NAACL, 2019.

Published Hoyt C, Domingo-Fernández D, Aldisi R, Xu L, Kolpeja K, Spalek S, Wollert E, Bachman J, Gyori BM, Greene P, Hofmann-Apitius M. Re-curation and rational enrichment of knowledge graphs in Biological Expression Language Database, 2019.

Published Bachman JA, Gyori BM, Sorger PK. FamPlex: a resource for entity recognition and relationship resolution of human protein families and complexes in biomedical text mining. BMC Bioinformatics, 2018 19(1):248.
Repository: FamPlex

Published Gyori BM, Bachman JA, Subramanian K, Muhlich JL, Galescu L, Sorger PK. From word models to executable models of signaling networks using automated assembly. Molecular Systems Biology, 2017 13(11):954.
Summary video: https://vimeo.com/265004298

Media

AI can help create new vaccines and drugs quickly highlights our work on the RAPTER project in collaboration with LANL.

Proteins and pathogens: a behind-the-scenes look at a biomedical researcher’s best friend Research spotlight on the Bioregistry project.

"A Step Toward New Solutions for Inflammatory Pain", a Harvard Medical School News article on our paper in Nature Immunology applying INDRA.

Award announcement about our award under the ARPA-H BDF Program

Announcements about the RAPTER project by LANL and by PNNL.

Award announcement : Charlie Hoyt awarded International Society for Biocuration, Excellence in Biocuration Early Career Award.

Award announcement : Ben Gyori awarded DARPA Director's Fellowship for 2022-23.

Award announcement : Grant supporting the MIRA project on the DARPA ASKEM program for 2022-26.

AAAS EurekAlert: Our application of the EMMAA system to neurofibromatosis, "Self-updating causal models to accelerate discovery in NF", won an award from the Children's Tumor Foundation. See also our presentation and report.

WIRED : Our research on human-machine collaboration was featured in WIRED UK, in the article " The merging of humans and machines is happening now", written by then director of DARPA, Arati Prabhakar.

The Guardian : Ben Gyori and John Bachman were interviewed by The Guardian in the tech podcast " Siri of the Cell". Here we introduce our approach to human-machine communication and the assembly of models from the scientific literature.

Harvard Medicine Magazine : Ben Gyori and John Bachman were interviewed for an article in Harvard Medicine Magazine. In "A Closer Read" (see section WALL-E), they talk about natural language processing and the INDRA system.

Harvard Medicine News : Ben Gyori was interviewed by the Harvard News to talk about how AI's Next Wave can be applied to scientific discovery in biology.

DARPA News & Events Our work on the DARPA ASKE program is described in this article.

Harvard Medicine News Our collaboration with the Churchman lab to develop GeneWalk was covered in the article "Panning for Genetic Gold".

Videos

Several talks are available on YouTube via the playlist here.