Identifies and localizes objects within images or videos. Fundamental for surveillance, robotics, and autonomous vehicles.