-
May 2023-Present
Comcast
Machine Learning Engineer
Developed workflows using Databricks components like Workflows, Unity Catalog, Delta Tables, Feature Store, MLFlow as building blocks for creating pipelines with stages for feature engineering, training, validation, scoring & publishing for Machine Learning Models.
Used Terraform as IaaC to deploy Databricks infrastructure resources on to different environments in the CI/CD pipeline.
Built CI/CD pipelines for ML code using Concourse and designed a standard workflow that is now being used by the entire organization for all ML projects
Deployed Model Endpoint using MLFlow, Docker to extract model from Databricks Model Registry and Terraform to deploy the model to AWS ECS
Technologies and Tools: Python, Databricks, Spark, MLFlow, AWS S3, AWS Athena, AWS Glue, AWS ECS, Terraform, Docker, Git, Linux
-
Nov 2021-April 2023
Audible - An Amazon Company
SDE - Machine Learning Engineering
Developed workflows using AWS components like Sagemaker, Lambda, Batch, Glue, EMR & Step Functions as building blocks for creating pipelines with stages for training, validation, scoring, publishing, load testing & deployment to production for Machine Learning Models.
Worked on the Frontend with React and Backend with FastAPI & AWS DynamoDB for the platform used for hosting Machine Learning pipelines using the stages mentioned earlier.
Used AWS Cloud Development Kit (CDK) to deploy and manage Cloud Resources on AWS.
Worked with Data Scientists to make the model code production ready and deploy them as containers to the AWS ECS repository for AWS Sagemaker to run Training and Inferencing.
Overall my work on Machine Learning Platform resulted in reducing engineering efforts of Data Science by 75%.
Collaborated with Data Scientists to develop and deploy multiple Machine Learning Models including Predictive Analytics, Text Classification, Anomaly Detection and Reinforcement Learning.
Technologies and Tools: Scala, Python, Java, Spark, AWS S3, AWS EMR, AWS Lambda, AWS ECR, AWS Batch, AWS Sagemaker, AWS Step Function, AWS CDK, Git, Linux
-
Jan 2020-Oct 2021
Capital One
Data Engineer
Developed Data Processing Pipeline using Apache Spark and Scala to process Credit Card Requests in batches with joining data from other sources and APIs and make the final output available for Card Embossing Process which will replace the existing mainframe system making the process more efficient by 70%.
Integrated Streaming Data Pipeline using Apache Kafka as an alternative option to batch processing in various data pipelines.
Built data pipelines for data transfer and warehousing using Enterprise File Gateway, Snowflake, Databricks and Apache Spark for incoming data from external sources to be used by analytics intents like Anti-Money Laundering and Fraud Detection using Anomaly Detection.
Built serverless functions using Python to spin up a transient AWS EMR to run the data pipelines.
Used AWS Lambda, AWS EMR, AWS CloudFormation and AWS S3 for production deployment.
Technologies and Tools: Scala, Java, Spark, Kafka, Git, AWS EC2, AWS S3, Linux, Spring Boot
-
June 2019-Oct 2019
Hello Nesh Inc.(Nesh)
Data Scientist - NLP & AI
Built Conversational AI agent in Python for Oil and Gas domain.
Developed Knowledge Extraction pipeline for public documents with Semantic Extraction using NER, Constituent Parsing with Rasa NLU & Spacy and built a Knowledge Graph using Dgraph, GraphQL to represent the extracted knowledge.
Extracted topics from public documents using Gensim & TextRazor and linked them to entities in Knowledge Graph.
Conceptualized and built PoC of Diagnostic Analysis and Predictive Analytics Engine for Oil Well Failure using E-M, MLE and MAP.
Trained and deployed text classifiers with Word Embeddings, LSTM, BERT using TensorFlow, Keras on AWS Sagemaker.
Technologies and Tools: Python, Java, Rasa, TensorFlow, Keras, SciKit-Learn, BERT, Stanford CoreNLP, Spacy, TextRazor, Gensim, Dash by Plotly, NumPy, SciPy, Pandas, Flask, Dgraph, Docker, Kubernetes, Nginx, Git, AWS EC2, AWS S3, AWS RDS, AWS Lambda, AWS Sagemaker, AWS DynamoDB, Gremlin, JanusGraph, Linux, Javascript, NodeJS
-
June 2018-May 2019
Kelley School of Business, Indiana University Bloomington
Graduate Research Assistant
Working under Prof. Matthew Josefy utilizing ML and NLP for research on Strategy and Entrepreneurship.
Research and implement NLP methods to extract relevant information from SEC Filings.
Develop and implement models using ML and NLP to analyze business model and board leadership structure of companies.
Built Text Classifiers using NLTK and SciKit-Learn with around 90% accuracy measured with 10-Fold Cross Validation.
Technologies and Tools: Python, Java, NumPy, SciKit-Learn, Pandas, Stanford CoreNLP, Spacy, Git, TensorFlow, Keras, BeautifulSoup, Windows
-
Feb 2018-May 2019
Ariadata Inc. (Aridat)
Chief NLP Research Engineer
Build an analytics engine to determine critical reception of an artists work based on chatter on social media.
Built Sentiment Classifier using Naїve Bayes and Multiclass Logistic Regression to classify tweets from artists as +1, 0 and -1 and implemented metrics to analyze sentiment distribution over different demographics.
Leading and advising on the research and implementations of advanced NLP methods to improve the efficiency of the Sentiment Classifier and add new functionalities to improvise the analytics provided to the artists.
Technologies and Tools: Python, Java, Numpy, Scikit-Learn, Pandas, Stanford CoreNLP, NLTK, Afinn, TextBlob, MongoDB, Git, Matplotlib, Plotly, TensorFlow, Keras, Linux
-
June 2014-July 2017
Vitruvian Technologies Pvt. Ltd. (RealtyRedefined)
Senior Developer & Team Mentor
Developed functionalities for web-based ERP and CRM systems in the domain of Real Estate.
Collaborated in a team of 12 for project development including Java Programming, Data Structure & Database Design, Web Design & Development and Unit Testing.
Lead, trained and mentored a sub-team of 5 throughout the development of the projects. Contributed towards the Core Framework, proprietarily used by the firm for project development.
Technologies and Tools: Java, Scala, Groovy, Spring Framework, Hibernate ORM, MySQL, Apache Solr, ElasticSearch, HTML 5, CSS 3, Javascript, AngularJS, UnderscoreJS, Bootstrap, AJAX, Jquery, PHP, Laravel Framework, Play Framework, Git, SVN, AWS EC2, AWS S3, AWS RDS, AWS Route53, AWS Cloud CDN, Linux
-
June 2013-May 2014
Algonation
Co-founder & Developer
Developed web portals and mobile apps for small and medium scale enterprises.
Built server software for TCP Layer Protocols customized for cloud-based industrial requirements.
Developed standalone and distributed softwares for parts of manufacturing production lines.
Mentored and trained groups of 3-4 undergraduate interns for developing industry level projects.
Technologies and Tools: Java, PHP, HTML 5, CSS 3, Bootstrap, Javascript, AJAX, Jquery, PHP, RabbitMQ, Netty, JavaFX, Openfire, Smack, XMPP, MySQL, Linux, Windows, Google Cloud Services
-
June 2013-May 2014
Research Innovation Incubation Design Laboratory (Riidl)
Software Engineer Intern
Developed web-based ERP application for educational institutes using HTML, CSS, Bootstrap, Javascript and PHP.
Designed the data structures & schema and managed the database transactions using MySQL.
Built mobile app for the ERP using Java Android SDK for Android phones and Java ME for Java-based feature phones.
Deployed the ERP on a hosting service using cPanel.
Technologies and Tools: Java, PHP, HTML, CSS, Bootstrap, Javascript, PHP, MySQL, Java ME, Android SDK, cPanel