Dr. Rishabh Upadhyay

is looking for freelance projects. 🔎

Bis 2017, Information technology, University of Mumbai

Milano, Italien

Über mich

I am a passionate Data Scientist with a focus on Machine Learning and a Ph.D. scholar in Information Retrieval and NLP. Throughout my career, I have successfully designed and trained production-ready speech recognition models for conversational speech in low resource languages, enabling efficient communication and accessibility. Beyond my expertise in speech recognition, I have worked on diverse projects such as health information retrieval and misinformation detection. Proficient in programming languages such as Python and experienced with tools like Torch, Scikit-learn, and Transformers. In addition to my technical abilities, I bring strong analytical thinking, problem-solving abilities, and a passion for continuous learning. Open to new opportunities, I am seeking a role that allows me to apply my skills and knowledge to drive data-informed decision-making, solve complex problems, and contribute to the success of an organization.

Fähigkeiten und Kenntnisse

Python
Machine Learning
Neural Networks
Speech Recognition
Data Analysis
Data Science
Natural language processing
Transformers
Deep learning
Artificial intelligence
Information-Retrieval
ML

Werdegang

Berufserfahrung von Rishabh Upadhyay

  • 4 Monate, März 2023 - Juni 2023

    Visiting Researcher

    Queen Mary, University of London

    Developed and improved a health information retrieval model using advanced NLP techniques like Natural Language Inference (NLI). Employed cosine similarity and an NLI-based stance detection model to compute a unique 'genuineness score' that assesses the credibility of health information, resulting in significant retrieval accuracy improvements. Worked extensively with Python, PyTorch, Transformers, BioBERT, and SciFIVE for NLI and biomedical text mining tasks.

  • 3 Monate, Okt. 2022 - Dez. 2022

    Visiting Researcher

    The Open University

    Augmented an existing health information retrieval model, focusing on increasing its transparency through explainability features. Integrated a comprehensive database of medical journal articles, providing detailed explanations in the form of evidences for retrieved health information. Utilized Python, PyTorch, Transformers, and BioBERT for biomedical text mining tasks.

  • 3 Jahre, Aug. 2018 - Juli 2021

    Senior Data Scientist

    PAX

    Developed and trained specialized Speech-To-Text (STT) models for ten low-resource languages, employing advanced techniques such as transfer learning and language-specific data augmentation. Created a scalable speech processing pipeline capable of handling over 1 billion audio recordings daily, leveraging cloud infrastructure and parallel processing techniques. Used Python (Pandas, Scikit-learn),, Git, and Kaldi for data analysis, machine learning, system operations, and speech recognition tasks.

  • 3 Monate, Mai 2016 - Juli 2016

    Research assistant

    Ben-Gurion University

    Developed pcStream2, a variant of pcStream, employing windowing and persistence techniques along with a novel IPCA algorithm called Just-In-Time PCA (JIT-PCA) for data stream clustering. Contributed to a research paper published in the 32nd ACM/SIGAPP Symposium On Applied Computing (SAC), showcasing project outcomes and advancements. Utilized R on the Ubuntu Linux platform for algorithm development, data manipulation, and evaluation.

  • 5 Monate, Dez. 2015 - Apr. 2016

    Student Researcher

    Hosei University

    Implemented an advanced algorithm to extract meaningful knowledge from academic documents employing Natural Language Processing techniques. The developed knowledge extraction algorithm resulted in publications in the Federated Conference on Computer Science and Information Systems (FedCSIS) and the Portland International Conference on Management of Engineering and Technology (PICMET)

Ausbildung von Rishabh Upadhyay

  • Bis heute 3 Jahre und 8 Monate, seit Nov. 2020

    Computer Science

    University of Milano Bicocca

    - DoSSIER H2020 ITN funding (Domain Specific Systems for Information Extraction and Retrieval). - Will be working on Assessing Credibility, Value, and Relevance from health related content.

  • 2 Jahre, Aug. 2017 - Juli 2019

    Computer Science

    Innopolis University

    This program consists of theoretical (1 year) and practical knowledge (1 year). List of Recent Courses: - Finite Optimisation. - Advance Machine Learning. - Randomised Algorithm.

  • 4 Jahre und 1 Monat, Juni 2013 - Juni 2017

    Information technology

    University of Mumbai

    Machine learning Data Mining Intelligence system

Sprachen

  • Englisch

    Fließend

  • Hindi

    Muttersprache

  • Italienisch

    Grundlagen

Interessen

Dance
Travel
Sports

21 Mio. XING Mitglieder, von A bis Z