Developing Data Processing Pipeline using Apache Spark and Scala to process Credit Card Requests in batches with joining data from other sources and APIs and make the final output available for Card Embossing Process.
Integrating Streaming Data Pipeline using Apache Kafka as an alternative option to batch processing in various data pipelines.
Technologies and Tools: Scala, Java, Spark, Kafka, Git, AWS EC2, AWS S3, Linux, Spring Boot
June 2019-Oct 2019
Hello Nesh Inc.(Nesh)
Data Scientist - NLP & AI
Built Conversational AI agent in Python for Oil and Gas domain.
Developed Knowledge Extraction pipeline for public documents with Semantic Extraction using NER, Constituent Parsing with Rasa NLU & Spacy and built a Knowledge Graph using Dgraph, GraphQL to represent the extracted knowledge.
Extracted topics from public documents using Gensim & TextRazor and linked them to entities in Knowledge Graph.
Conceptualized and built PoC of Diagnostic Analysis and Predictive Analytics Engine for Oil Well Failure using E-M, MLE and MAP.
Trained and deployed text classifiers with Word Embeddings, LSTM, BERT using TensorFlow, Keras on AWS Sagemaker.
June 2018-May 2019
Kelley School of Business, Indiana University Bloomington
Graduate Research Assistant
Working under Prof. Matthew Josefy utilizing ML and NLP for research on Strategy and Entrepreneurship.
Research and implement NLP methods to extract relevant information from SEC Filings.
Develop and implement models using ML and NLP to analyze business model and board leadership structure of companies.
Built Text Classifiers using NLTK and SciKit-Learn with around 90% accuracy measured with 10-Fold Cross Validation.
Technologies and Tools: Python, Java, NumPy, SciKit-Learn, Pandas, Stanford CoreNLP, Spacy, Git, TensorFlow, Keras, BeautifulSoup, Windows
Feb 2018-May 2019
Ariadata Inc. (Aridat)
Chief NLP Research Engineer
Build an analytics engine to determine critical reception of an artists work based on chatter on social media.
Built Sentiment Classifier using Naїve Bayes and Multiclass Logistic Regression to classify tweets from artists as +1, 0 and -1 and implemented metrics to analyze sentiment distribution over different demographics.
Leading and advising on the research and implementations of advanced NLP methods to improve the efficiency of the Sentiment Classifier and add new functionalities to improvise the analytics provided to the artists.
Technologies and Tools: Python, Java, Numpy, Scikit-Learn, Pandas, Stanford CoreNLP, NLTK, Afinn, TextBlob, MongoDB, Git, Matplotlib, Plotly, TensorFlow, Keras, Linux
June 2014-July 2017
Vitruvian Technologies Pvt. Ltd. (RealtyRedefined)
Senior Developer & Team Mentor
Developed functionalities for web-based ERP and CRM systems in the domain of Real Estate.
Collaborated in a team of 12 for project development including Java Programming, Data Structure & Database Design, Web Design & Development and Unit Testing.
Lead, trained and mentored a sub-team of 5 throughout the development of the projects. Contributed towards the Core Framework, proprietarily used by the firm for project development.
June 2013-May 2014
Co-founder & Developer
Developed web portals and mobile apps for small and medium scale enterprises.
Built server software for TCP Layer Protocols customized for cloud-based industrial requirements.
Developed standalone and distributed softwares for parts of manufacturing production lines.
Mentored and trained groups of 3-4 undergraduate interns for developing industry level projects.
June 2013-May 2014
Research Innovation Incubation Design Laboratory (Riidl)
Software Engineer Intern
Designed the data structures & schema and managed the database transactions using MySQL.
Built mobile app for the ERP using Java Android SDK for Android phones and Java ME for Java-based feature phones.
Deployed the ERP on a hosting service using cPanel.