Summary
Overview
Work History
Education
Skills
Websites
Projects
Timeline
Generic

HARITHA L

Chennai

Summary

Senior Software Engineer with expertise in solution design for image processing and text extraction, leveraging deep learning and heuristic approaches to drive significant business improvements. Proven track record of implementing machine learning solutions in the banking sector, consistently delivering high-quality results within deadlines. Advocates for humanistic management techniques that enhance employee morale while optimizing production efficiency. Focused on balancing technical innovation with team well-being to foster long-term organizational success.

Overview

6
6
years of professional experience

Work History

Senior Software Engineer

RedBlackTree
Chennai
02.2023 - 06.2023
  • Worked on a Google tool-based solution with NLP techniques for extracting data from various bills and payment slips from various companies and making it into a generalized format.
  • Executed seamless integration with backend framework to ensure functionality.
  • Solely designed and built an innovative tool within a four-month timeframe.

Deep Learning Specialist

STANDARD CHARTERED GLOBAL BUSINESS SERVICES
Chennai
01.2020 - 11.2022
  • Developed a robust AI and ML image-processing engine to streamline operational efficiency.
  • Employ advanced stacked generalization of deep neural networks for enhanced document classification.
  • Created a sophisticated deep neural network pipeline for accurate detection, localization, extraction, and verification of document content.
  • Established a logger framework and feedback loop for the gradual enhancement of deep learning models.
  • Designed an architecture leveraging deep neural networks and heuristics for efficient content extraction from tabular formats, converting data into actionable key-value pairs.
  • Assessed the success of key programs, provided governance oversight, and presented impactful updates to leadership, maintaining stakeholder engagement.
  • Honored with several awards for outstanding contributions to design, and innovative problem-solving.
  • Skill set: Deep neural net models, ML algorithms, python, NLP

Assistant Production Manager

TRITAN LEATHERS PVT LTD.
Chennai
05.2017 - 05.2019
  • Determined project objectives, budgets and schedules by coordinating with clients and teammates and optimized plans to meet changing conditions
  • Maintained privacy and confidentiality of all information for existing and prospective clients to protect personal and business interests

Education

Post Graduate - Data Science Engineering

Great Lakes
12.2019

B.Tech - Leather Technology

Anna University
04.2017

Skills

  • Statistics
  • Regression Analysis
  • Classification
  • Ensembles Learning
  • Cluster Analysis
  • Deep Neural Networks
  • NLP, BERT, LNM
  • AWS
  • Python, SQL
  • Opencv, PyTorch, TensorFlow
  • Django, AWS

Projects

  • Document profiling

An application to extract text from images (TIFF, JPG, PNG), and also from documents (PDF, OXPS) This tool is capable of handling various image pre-processing tasks, such as orientation and skew correction, which, in turn, improve the data extraction This tool is developed on a generic basis, and is independent of any document type or template Using this tool, data is extracted from various bank documents, and then further entity extraction is done based on the business requirements Based on the problem statement, some additional features are added to this tool for further enhancement, including transactional documents for payment (debit and credit), payslips, and tax documents

Skills Set: Pytesseract, CTPN, Python, OpenCV, EasyOCR, NLP, NER, and GIT

  • Data extraction from cheques:

To extract critical fields: payee name, payee address, bank name, cheque date, cheque amount, routing number, check number, and account number, using HTR and Tesseract as the OCR engine, and a deep learning-based object detection algorithm to detect various regions of the cheque.

Skill set: Pytesseract, CNN, FRCNN, HTR, Tesseract, TensorFlow, and GIT 

  • Data extraction with UI development

This is a React tool that uses simple Python packages to perform data extraction from the uploaded documents through the UI, followed by the data-refining process to save the output in a unified format, as per business requirements. This tool has API and UI components for user-friendly access: Tabula, Python, Django, GIT

Timeline

Senior Software Engineer

RedBlackTree
02.2023 - 06.2023

Deep Learning Specialist

STANDARD CHARTERED GLOBAL BUSINESS SERVICES
01.2020 - 11.2022

Assistant Production Manager

TRITAN LEATHERS PVT LTD.
05.2017 - 05.2019

Post Graduate - Data Science Engineering

Great Lakes

B.Tech - Leather Technology

Anna University
HARITHA L