Markerless tracking and pose estimation on the web browser with single camera

afasew22 · June 14, 2024, 2:42am

Hello everyone,

We’re in the process of developing a web application that displays an MJPEG stream. Our goal is to identify specific objects in the video in real-time and perform pose estimation. Ideally, we’d like it to function in a way similar to fiduciary markers, but instead, leverage a machine learning algorithm to enable markerless tracking.

Could anyone share insights on whether this is feasible using TensorFlow.js?

Niloy_Deb_Barma · June 18, 2024, 2:35am

Yes, markerless tracking and pose estimation with a single camera in a web browser using TensorFlow.js is feasible. Use models like PoseNet or MoveNet for pose estimation and Coco SSD for object detection. Implement by:

Including TensorFlow.js in your project.
Loading the appropriate model.
Accessing the camera with getUserMedia.
Processing video frames through the model.
Rendering results on a canvas over the video stream.

These models are efficient and can run in real-time directly in the browser.

Jason · June 18, 2024, 9:27am

+1 to what @Niloy_Deb_Barma said. Just following up with a few links to consider:

Custom object detection blogs:

Learn how to make a smart camera in TensorFlow.js using COCO-SSD pretrained model:

Video version of codelab:

MoveNet pose estimation:

BlazePose GHUM 3D pose estimation:

Hope that helps!

Learn more about using TensorFlow.js via my course: