Summary
Overview
Work History
Education
Skills
Additional Information
Accomplishments
Languages
Certification
Websites
Timeline
Generic

Prathiksha Ramesh

Coimbatore

Summary

Well-qualified Bioinformatician and Computational Biologist, knowledgeable in NGS data analysis, including bulk RNA-seq, single-cell RNA-seq, and downstream analysis. Successful in developing and optimizing bioinformatics pipelines using key tools like Kallisto, FastQC, MultiQC, Seurat, edgeR, and Bioconductor. Skilled in conducting advanced data analysis, machine learning, and statistical modeling to investigate complex biological datasets. Extensive experience in developing AI-driven chatbots using generative AI and NLP techniques. Over 2 years of research and practical experience in computational biology, data science, and genomics, including international research exposure as a DAAD scholar.

Overview

2
2
years of professional experience
1
1
Certificate

Work History

Remote Research Trainee in Computational Biology

University of Pennsylvania
06.2024 - 09.2024

Overview:
As a Remote Research Trainee at the University of Pennsylvania, I focused on gaining specialized knowledge in RNA sequencing (RNA-Seq) analysis. The training involved hands-on experience with advanced bioinformatics tools and methodologies to analyze large-scale RNA-seq datasets, improving my understanding of gene expression and transcriptomics.

Key Learnings:

  • Acquired expertise in RNA sequencing analysis, including bulk RNA-seq and single-cell RNA-seq, utilizing bioinformatics tools such as Kallisto, DESeq2, and Seurat.
  • Developed skills in preprocessing RNA-seq data, including quality control using FastQC and MultiQC, as well as transcript quantification and differential gene expression analysis.
  • Gained practical experience in using R and Python for RNA-seq data manipulation and visualization, enhancing my ability to interpret complex gene expression data.
  • Collaborated with remote mentors and peers, learning to apply cutting-edge methodologies to gene expression studies and integrating results with functional genomics analysis.

Skills, Tools, and Libraries Gained:

  • RNA-Seq Tools: Kallisto, DESeq2, Seurat, FastQC, MultiQC
  • Programming Languages: Python, R
  • RNA-Seq Analysis: Differential Gene Expression, Single-cell RNA-seq, Gene Expression Profiling
  • Data Visualization: ggplot2, Tidyverse, Matplotlib
  • Other Tools: Jupyter Notebook, R Studio, GitHub
  • Libraries: NumPy, Pandas, Scikit-learn, TensorFlow, PyTorch, Bioconductor

Computational Biology Researcher (Master's Thesis)

RWTH Aachen University
09.2023 - 03.2024

Project Overview:
As part of a prestigious DAAD scholarship, I developed my own research proposal to tackle the challenge of missing data in gene expression datasets, which was selected for execution at RWTH Aachen University. The research focused on solving the bioinformatics problem of missing data imputation in large-scale gene expression datasets, leveraging machine learning (ML) and deep learning (DL) techniques.

Key Contributions:

  • Gathered and organized data for research purposes, designing and developing ML and DL models to handle missing data in gene expression datasets.
  • Conducted thorough literature reviews to identify gaps in existing methodologies and inform future research directions.
  • Collaborated with interdisciplinary teams, including experts in computational biology, machine learning, and data science, to execute comprehensive studies and generate valuable insights.
  • Applied ethical considerations throughout all stages of the research process, ensuring data integrity and safeguarding participant welfare.
  • Successfully developed and optimized imputation methods, leading to more reliable and accurate gene expression analyses.
  • The project significantly enhanced the reliability of biological data analysis and contributed to more robust gene expression insights.
Skills Gained:
  • Computational Biology: Developed a deep understanding of gene expression datasets, particularly the challenges posed by missing data, and how to leverage computational techniques to address these issues.
  • Machine Learning & Deep Learning: Designed and implemented ML and DL models for data imputation, enhancing the accuracy of gene expression analyses.
  • Data Imputation Techniques: Acquired proficiency in various data imputation methods, optimizing approaches to handle missing data in large-scale genomic datasets.
  • Research & Analysis: Conducted extensive literature reviews to identify methodological gaps and used this information to inform future research directions.
  • Interdisciplinary Collaboration: Worked with experts from computational biology, machine learning, and data science, improving teamwork and knowledge integration across domains.
  • Problem Solving: Developed and optimized imputation models, solving the critical problem of missing data in gene expression datasets, leading to more accurate and reliable results.
  • Technical Skills: Gained experience with bioinformatics tools and libraries (e.g., Python, R, ML/DL frameworks) and advanced data science techniques for analyzing biological datasets.

Teaching Assistant (During Master's Program)

Indian Institute of Technology Gandhinagar
07.2022 - 08.2023

Overview:
While pursuing my master’s degree, I served as a Teaching Assistant at IIT Gandhinagar, where I supported over 50 students in courses such as Journal Writing, Energy & Matter Laboratory, and World of Engineering. My role involved enhancing students' research and data analysis skills, as well as leading them through practical engineering projects.

Key Contributions:

  • Assisted students in technical writing and critical thinking.
  • Led students through hands-on engineering projects, contributing to improved academic performance and successful project completions.
  • Provided academic support in laboratory sessions, helping students apply complex concepts in practical settings.
  • Collaborated with faculty to design and evaluate coursework and assessments, ensuring alignment with course objectives.

Skills Gained:

  • Research & Analysis: Helped students refine their research methods and data interpretation.
  • Project Management: Led students through project development and execution, managing timelines and outcomes.
  • Mentorship: Offered personalized academic guidance and feedback to improve student performance.
  • Collaboration: Worked with faculty to ensure the seamless execution of course content and objectives.

Education

Master of Technology(MTech) - Biological Engineering

Indian Institute of Technology.Gandhinagar
Gandhina, India
06.2024

Bachelor of Technology(BTech) - Industrial Biotechnology

Government College of Technology(GCT)
India
05.2022

Skills

  • Python (Advanced), R (Beginner), SQL (Beginner), Shell/Bash Scripting (Intermediate)
  • NGS Data Analysis: Expertise in bulk RNA-seq, single-cell RNA-seq, and downstream analyses
  • Downstream Analysis:Differential Gene Expression: DESeq2, edgeR, limma
    Functional Enrichment Analysis: GSEA, fgsea, ClusterProfiler
    Module/Cluster Identification: Seurat, Scater/Scran
    Tools: Kallisto, FastQC, MultiQC, EnsemblDB
  • Single-cell RNA-seq: Analysis, integration, and gene expression profiling
  • Bioinformatics Pipelines: Design and implementation for large-scale genomic data
  • Data Imputation: Handling missing data in gene expression datasets
  • Gene Expression Data Analysis: Using Bioconductor packages and associated tools in R
  • Data Management: Data Cleaning, Data Preprocessing, Feature Engineering, EDA
  • Model Development: Supervised Learning (Regression, Classification), Unsupervised Learning (Dimensionality Reduction, Clustering)
  • Advanced Techniques: Hyperparameter Tuning, Cross-Validation, Model Evaluation Metrics
  • Ensemble Learning: Random Forest, Gradient Boosting, XGBoost
  • Machine Learning & Deep Learning: Experience in building ML/DL models for biological data
  • Tools: NumPy, Pandas, Matplotlib, Seaborn, Scikit-learn, TensorFlow, PyTorch, Keras
  • Genomic Data Science: Handling and analyzing large-scale biological datasets using R and Python
  • Functional Genomics: Pathway enrichment, ontology analysis, network analysis
  • Visualization: ggplot2, plotly, RColorBrewer, gplots
  • Chatbot Development: Extensive experience in building AI-driven chatbots using generative AI and NLP techniques
  • Natural Language Processing (NLP): Transformer-based models and fine-tuning LLMs for conversational systems
  • Tools: LangChain, Hugging Face, OpenAI Models, RAG, Gemini, Mistral, Phi3
  • IDE/Platforms: VS Code, Jupyter Notebook, Google Colab, R Studio
  • Cloud & DevOps: Docker, Git & GitHub, AWS Beanstalk, EC2 Instance, Streamlit Cloud, HPC, AWS Cloud
  • Databases: AstraDB, MySQL, Pinecone
  • Linux, MacOS, Windows
  • Techniques: Molecular Docking, Molecular Dynamics Simulation, Free Energy Calculations
  • Tools: Vina, NAMD, GROMACS
  • Communication: English Proficiency, Scientific Written Communication, Research Proposal Writing, PowerPoint Presentation, Poster Presentation
  • Collaboration: International Collaboration, Teamwork, Adaptability to Emerging Technologies
  • Problem Solving & Critical Thinking: Analytical Problem Solving, Scientific Thinking
  • Time Management & Adaptability: Quick Adaptation to New Tools and Technologies, Efficient Project Management
  • Curiosity: Driven by Scientific Curiosity and Lifelong Learning

Additional Information

1. In-Silico Docking Studies of Selected Phytochemicals Against Papain-Like Protease of SARS-CoV-2 • Type: Research Article • Contribution: Co-authored the paper during my undergraduate studies, where I performed the molecular dynamic simulations that supported the docking studies. • Status: Published

2. Basepairs in a Pocket: Human Revolution Towards Precision Medicine • Type: Perspective Article • Contribution: First author of this paper, currently under preparation. The article explores the role of precision medicine in the future of healthcare, emphasizing the importance of genetic and genomic information in treatment personalization. • Status: Under Preparation

3. Benchmarking Existing Imputation Methods for Handling Missing Data in Machine Learning Gene Expression Pipeline • Type: Master's Research Paper • Contribution: First author of the paper, co-authored with my German supervisor and Indian supervisor. The research benchmarks various imputation methods for managing missing data in gene expression datasets within machine learning pipelines. • Status: Under Preparation



Accomplishments

  • School First in 10th Grade: Achieved the highest marks, including a perfect score in Mathematics.
  • School First in 12th Grade: Secured the top position in my school with the highest overall marks.
  • Merit Scholarship: Awarded a fully-funded merit seat based on academic excellence for pre-university performance.
  • GATE Scholarship: Cleared the GATE exam (top 0.2%), securing admission to IIT Gandhinagar with an MHRD-funded stipend.
  • DAAD Scholarship: Selected as one of 70 students across 23 IITs to pursue a fully-funded master’s thesis in Germany.
  • Best Poster Award: First prize for a poster presentation at the PG Research Showcase, IIT Gandhinagar.
  • Special Mention by Director: Recognized during the convocation speech for DAAD Scholarship and research excellence.

Languages

English
Advanced (C1)
Tamil
Bilingual or Proficient (C2)
kannada
Upper intermediate (B2)
German
Beginner (A1)

Certification

  • Certified Professional in Advanced Programming and Masters in Data Science – GUVI & IIT Madras Research Park (Nov 2022 - May 2023)
  • Genomic Data Science Specialization – Johns Hopkins University (Completed 2024)
  • Complete Machine Learning, NLP Bootcamp & Deployment – Udemy (Completed 2024)
  • Complete Generative AI Course with LangChain and Hugging Face – (Completed 2024)
  • Teaching Assistant Certificate – IIT Gandhinagar (Awarded 2024)

Timeline

Remote Research Trainee in Computational Biology

University of Pennsylvania
06.2024 - 09.2024

Computational Biology Researcher (Master's Thesis)

RWTH Aachen University
09.2023 - 03.2024

Teaching Assistant (During Master's Program)

Indian Institute of Technology Gandhinagar
07.2022 - 08.2023

Master of Technology(MTech) - Biological Engineering

Indian Institute of Technology.Gandhinagar

Bachelor of Technology(BTech) - Industrial Biotechnology

Government College of Technology(GCT)
Prathiksha Ramesh