Skip to content

Latest commit

 

History

History
161 lines (115 loc) · 7.07 KB

File metadata and controls

161 lines (115 loc) · 7.07 KB

tech_stack_canvas




20th August'2021

GSoC '21 Report | Aniket Ranjan | @NRNB | Enhancement of Open_Source_Protein_Interaction_Platform

Introduction

The Google summer of code program has been a great and fun learning experience to me over the past months. The project was aimed to enhance the project of openPIP (Open_Source_Protein_Interaction_Platform) to make it production ready.


What is OpenPIP

OpenPIP (Open Source Protein Interaction Platform) is a web application which can be used to visualize, modify, analyze and download thousands of complex protein-protein interactions for further analysis, prediction and research work. The protein interaction data is fed into the platfrom using .psi file format which contains information about the protein interactions and their annotations. The webapplication gets the uniprot/ensemble ID from the .psi file and fetches the protein data like protein_name, gene_name, protein_sequence, description, external_links and saves them into the database. The protein interactions are then visualized using Cytoscape.js library. The application supports search filters which can be used to enrich and filter desired results which can be saved for further analysis.

Protein Interaction data file

data_upload_file



OpenPIP webapp

homepage



Protein_Interaction Visualization using CytoscapeJS

canvas_interaction




Work Summary and Pull requests

BaderLab/openPIP#76

The following features are integrated with this pull request:

Done
  • Inclusion of vendor files in this Symfony project.

  • Fixed double headers in data, files, and announcement section.

  • Fixed navigation bar in data section.

  • Main title made dynamic which can be changed via admin settings.

  • Fixed bug in the search filter.

  • Mission and Method section in the 'home page' made dynamic to be changed via admin settings.

  • Home page heading changed to the dynamic short title.

  • File section: recreated the complete section with the following features.

    • server-side creation and deletion of folder.
    • uploading of any file format in any folder of choice.
    • download, delete and copy path button for each file inside the folder.
    • dropzone section to upload files via drop or click.
    • supports multiple file uploads at once.
    • file upload with progress bar, remove and cancel options.
    • mapping of all uploaded files to data section for protein upload.
    • UI/UX ++
  • Data section: the following features were added.

    • Button to purge the database completely, including proteins, interactions, annotations, external links, etc.
    • Affirmation validation on clean database button.
    • Section for protein upload to the database via .psi format files natively.
    • Choice type form showing all uploaded files on the server for interaction upload selection.
    • Extention of upload to all file formats having tab separated entries.
    • Added validation before file upload, validating upload with file type, double affirmation, and estimated time.
    • Added protein and interaction count in upload validation.
    • Added countdown timer with the estimated time when protein and interactions are being uploaded.
    • Estimated time for the countdown timer made dynamic to each file selected for upload.
    • Added cards showing proteins and interactions present in the database.
    • Section to view gene names uploaded and direct link to visit the search bar with the selected gene name.
    • UI/UX ++
  • Packaged the opepPIP application in docker-compose file:

    • web application is packaged with containerized MySQL8.0 database.
    • application is served using containerized php:7.2.0-apache server.
    • docker-compose file having all the dependencies and interconnections for application.
    • Single command to start the application: docker-compose up.
  • General bug fixes, code cleanup and refactoring.

To Do
  • Documentation, readme and installation guide.
  • Integration of interaction and protein annotation during upload.

Future Scope

  • Optimization of data upload for faster insertion in database. Currently it takes 1.5 seconds for insertion of each protein.
  • Integration of Interaction category, annotations during data upload.
  • Optimizing upload for ensemble and entrez id.
  • Integration of Datasets handle and download.
  • Integration of complex, domain, isoform if when needed.
  • User optimization and registration. Mapping user with proteins uploaded.
  • Upgrading of symfony framework to laset release, since new features can't be installed in unmaintained old version.

Important Links


Special Thanks

I would like to wholeheartedly thank my mentors who were my constant guide and ofcourse without whome this wasn't possible. Thanks for your valuable feedbacks, guidance and project planning. You were amazing!

1. Gary Bader

Professor of Molecular Genetics and Computer Science, The Donnelly Centre, University of Toronto
Profile linkedin LinkedIn  

2. Mohamed Helmy

Senior Specialist, Bioinformatics Institute (BII) at A*STAR
Profile linkedin LinkedIn  


Contact

linkedin LinkedIn   github Github