Data Science Intern @ CODMAV Research Centre (PES University)

Sat, 01 Jun 2024 00:00:00 +0000

PES University · Jan 2026 – May 2026

Overview

During my tenure at the Centre of Data Modelling, Analytics and Visualization (CODMAV), I worked at the intersection of healthcare and Artificial Intelligence. My primary objective was to build a robust predictive system capable of identifying lung cancer risk at an early stage, which is critical for patient survival rates.

The Technical Challenge

The core difficulty of this project lay in the sheer scale and sparsity of the raw clinical data. Sourced from the Harvard Dataverse (Lung Cancer Risk Prediction Dataset) , the initial dataset was massive but significantly noisy, comprising 22,811 patient records and 788 health markers.

Machine Learning on Sumukh Acharya

Data Science Intern @ CODMAV Research Centre (PES University)

Overview

The Technical Challenge