Skip to main content
  1. Projects/

Autonomous Vision - Real-Time Object Detection

Pranav
Author
Pranav
Coding with Physics, Physics-ing with code

Overview
#

Built a convolutional model to detect vehicles and pedestrians from driving video. The goal was to build a practical detection pipeline and present the model architecture and training decisions to a technical review panel.

Role / Team / Timeline
#

Role: Model development and training Team: Inspirit AI (Stanford/MIT alumni led cohort) Timeline: 2023 (2-week program)

Technical Approach
#

  • Model: VGG-16 based CNN for classification of cars, trucks, jeeps, and pedestrians.
  • Data: Custom Kaggle video dataset with labeled classes.
  • Pipeline: Preprocessing plus edge/pattern recognition to support robustness.
  • Training: 5 hours on Google Colab T4 GPU (about 16 hours total development time).

Results
#

  • Produced a working detector that identifies moving objects in driving scenarios.
  • Presented architecture decisions and supervised learning approach to a peer review panel.

Tools / Stack
#

  • Python
  • TensorFlow / Keras
  • Google Colab

Links / Media #

  • Demo videos and model artifacts available on request.

Related