Computer Vision & Age Detection
Python · VGG16 · Transfer Learning · PCA · K-Means · SVM · PyTorch · Keras
A 19% accuracy improvement, not from a better model, but from rethinking how the problem was framed. Restructuring the data proved more powerful than any architecture change.
The Question
Age detection from facial images is harder than it looks. Genetic diversity, lighting, and environmental variance all introduce noise, but the biggest obstacle was structural. The original model used 11 granular age classes, creating too many decision boundaries for the model to learn reliably across a dataset of 20,000 images. Rather than tuning the model, the breakthrough came from redesigning the data: 11 classes were reorganized into 5 demographic clusters using K-Means, dramatically reducing classification ambiguity. VGG16 Transfer Learning with PCA-optimized feature extraction was then applied to the restructured dataset. Can a data design decision outperform a model architecture decision?
