Dr. Rishabh Upadhyay
Bis 2017, Information technology, University of Mumbai
Milano, Italien
Über mich
I am a passionate Data Scientist with a focus on Machine Learning and a Ph.D. scholar in Information Retrieval and NLP. Throughout my career, I have successfully designed and trained production-ready speech recognition models for conversational speech in low resource languages, enabling efficient communication and accessibility. Beyond my expertise in speech recognition, I have worked on diverse projects such as health information retrieval and misinformation detection. Proficient in programming languages such as Python and experienced with tools like Torch, Scikit-learn, and Transformers. In addition to my technical abilities, I bring strong analytical thinking, problem-solving abilities, and a passion for continuous learning. Open to new opportunities, I am seeking a role that allows me to apply my skills and knowledge to drive data-informed decision-making, solve complex problems, and contribute to the success of an organization.
Werdegang
Berufserfahrung von Rishabh Upadhyay
4 Monate, März 2023 - Juni 2023
Visiting Researcher
Queen Mary, University of London
Developed and improved a health information retrieval model using advanced NLP techniques like Natural Language Inference (NLI). Employed cosine similarity and an NLI-based stance detection model to compute a unique 'genuineness score' that assesses the credibility of health information, resulting in significant retrieval accuracy improvements. Worked extensively with Python, PyTorch, Transformers, BioBERT, and SciFIVE for NLI and biomedical text mining tasks.
3 Monate, Okt. 2022 - Dez. 2022
Visiting Researcher
The Open University
Augmented an existing health information retrieval model, focusing on increasing its transparency through explainability features. Integrated a comprehensive database of medical journal articles, providing detailed explanations in the form of evidences for retrieved health information. Utilized Python, PyTorch, Transformers, and BioBERT for biomedical text mining tasks.
3 Jahre, Aug. 2018 - Juli 2021
Senior Data Scientist
PAX
Developed and trained specialized Speech-To-Text (STT) models for ten low-resource languages, employing advanced techniques such as transfer learning and language-specific data augmentation. Created a scalable speech processing pipeline capable of handling over 1 billion audio recordings daily, leveraging cloud infrastructure and parallel processing techniques. Used Python (Pandas, Scikit-learn),, Git, and Kaldi for data analysis, machine learning, system operations, and speech recognition tasks.
3 Monate, Mai 2016 - Juli 2016
Research assistant
Ben-Gurion University
Developed pcStream2, a variant of pcStream, employing windowing and persistence techniques along with a novel IPCA algorithm called Just-In-Time PCA (JIT-PCA) for data stream clustering. Contributed to a research paper published in the 32nd ACM/SIGAPP Symposium On Applied Computing (SAC), showcasing project outcomes and advancements. Utilized R on the Ubuntu Linux platform for algorithm development, data manipulation, and evaluation.
5 Monate, Dez. 2015 - Apr. 2016
Student Researcher
Hosei University
Implemented an advanced algorithm to extract meaningful knowledge from academic documents employing Natural Language Processing techniques. The developed knowledge extraction algorithm resulted in publications in the Federated Conference on Computer Science and Information Systems (FedCSIS) and the Portland International Conference on Management of Engineering and Technology (PICMET)
Ausbildung von Rishabh Upadhyay
Bis heute 3 Jahre und 8 Monate, seit Nov. 2020
Computer Science
University of Milano Bicocca
- DoSSIER H2020 ITN funding (Domain Specific Systems for Information Extraction and Retrieval). - Will be working on Assessing Credibility, Value, and Relevance from health related content.
2 Jahre, Aug. 2017 - Juli 2019
Computer Science
Innopolis University
This program consists of theoretical (1 year) and practical knowledge (1 year). List of Recent Courses: - Finite Optimisation. - Advance Machine Learning. - Randomised Algorithm.
4 Jahre und 1 Monat, Juni 2013 - Juni 2017
Information technology
University of Mumbai
Machine learning Data Mining Intelligence system
Sprachen
Englisch
Fließend
Hindi
Muttersprache
Italienisch
Grundlagen