This project focused on developing a hybrid biomedical imaging framework that integrates MATLAB-based preprocessing and Python-based deep learning models to detect brain tumors from MRI scans. The primary challenge addressed was the data scarcity problem common in medical imaging, where the limited dataset size often leads to underfitting and model instability.
The dataset—sourced from Kaggle’s Brain MRI Images for Tumor Detection—contained 253 images (155 tumor, 98 non-tumor) in mixed formats (.jpg, .jpeg, .png). To enhance diversity and combat overfitting, the dataset underwent batchwise preprocessing, expanding it to 604 images through contrast adjustment, wiener filtering, and anisotropic diffusion.
The MATLAB module performed tumor detection through a sequence of steps including anisotropic diffusion filtering for edge-preserving smoothing, thresholding and morphological operations for segmentation, and bounding box generation for tumor localization. It achieved approximately 92% detection accuracy on the test subset.
The following models were implemented and evaluated using the preprocessed dataset. Each model represents a stage of improvement in both accuracy and stability.
The following models were implemented and evaluated using the preprocessed dataset. Each model represents a stage of improvement in both accuracy and stability.
The VGG16 model suffered from overfitting due to limited data diversity, achieving only 60% testing accuracy. The modified ResNet model improved testing accuracy to 72.55% by incorporating dropout and batch normalization. Finally, ResNet50, fine-tuned using transfer learning, achieved 100% training and 90.62% testing accuracy, demonstrating superior generalization performance despite a small dataset size.
• Expanded MRI dataset (253 → 604 images) through batch preprocessing for better generalization.
• Validated the importance of preprocessing in mitigating data hunger in medical AI models.
• Demonstrated effective tumor segmentation and visualization using edge-preserving filters.
• Showed that transfer learning (ResNet50) outperformed traditional CNNs for limited medical datasets.
• Established a foundational pipeline adaptable to 3D MRI and CT imaging.