{ "cells": [ { "attachments": {}, "cell_type": "markdown", "metadata": {}, "source": [ "# Comparing PPIs\n", "\n", "The PPIRef package provides wrappers for [iAlign](https://doi.org/10.1093/bioinformatics/btq404) and [US-align](https://www.biorxiv.org/content/10.1101/2022.04.18.488565v1), as well as their scalable approximation [iDist](https://arxiv.org/pdf/2310.18515.pdf) (used to construct the PPIRef dataset) for comparing PPI structures. Additionally it provides a sequence identity comparator to compare PPIs by their sequences.\n", "\n", "> 📌 Using wrappers for iAlign and US-align requires their installation. Please refer to the Reference API documentation for details." ] }, { "cell_type": "code", "execution_count": 2, "metadata": {}, "outputs": [], "source": [ "from ppiref.comparison import IAlign, USalign, IDist, SequenceIdentityComparator, FoldseekMMComparator\n", "from ppiref.extraction import PPIExtractor\n", "from ppiref.definitions import PPIREF_TEST_DATA_DIR\n", "\n", "# Suppress BioPython warnings\n", "import warnings\n", "from Bio import BiopythonWarning\n", "warnings.simplefilter('ignore', BiopythonWarning)\n", "\n", "# Suppress Graphein log\n", "from loguru import logger\n", "logger.disable('graphein')" ] }, { "attachments": {}, "cell_type": "markdown", "metadata": {}, "source": [ "Prepare near-duplicate PPIs from Figure 1 in the [\"Learning to design protein-protein interactions with enhanced generalization\"](https://arxiv.org/pdf/2310.18515.pdf) paper.\n", "\n", "
\n",
"
\n",
"
| \n", " | PPI0 | \n", "PPI1 | \n", "iDist | \n", "
|---|---|---|---|
| 0 | \n", "1p7z_A_C | \n", "1p7z_A_C | \n", "0.000000 | \n", "
| 1 | \n", "1p7z_A_C | \n", "3p9r_B_D | \n", "0.003466 | \n", "
| 2 | \n", "3p9r_B_D | \n", "1p7z_A_C | \n", "0.003466 | \n", "
| 3 | \n", "3p9r_B_D | \n", "3p9r_B_D | \n", "0.000000 | \n", "