| Semantic Structure From Motion (SSFM) --- estimating objects and cameras in a scene from images |
||
|
|
||
| What is it about? |
Semantic
Structure from Motion (SSFM) is a new framework
for jointly recognizing objects and reconstructing the
underlying 3D geometry of a scene (cameras, points and objects). SSFM
framework leverages on the intuition that measurements of
keypoints and objects must be semantically and geometrically consistent
across view points. SSFM has the unique ability to: i) enhance camera
pose estimation, compared to feature-point-based SFM algorithms; ii)
improve object detections given multiple uncalibrated images, compared
to independently detecting objects in single images. iii)
estimate camera poses from object detections only.![]() Check out our paper for details! |
Update Dec 20, 2011
|
| Who might be
interested? |
Researchers who
are familiar with the keywords -- Structure From Motion, Object
Detection, Scene Understanding -- might find SSFM useful and a
state-of-the-art baseline to compare with. |
|
| Papers and Citations |
download pdf and bibtex download pdf and bibtex. Winner of the best student paper award |
|
| About authors |
Sid Yingze Bao is
a PhD student building an automatic system that
can interpret a scene's 3D structure and understand the scene
component's semantical label. |
|
| Results | Below
are three youtube videos highlighting SSFM. We show input images, the
limitations of single image object recognition approach, and the final
result of SSFM (improved 2D object recognition and reconstructed 3D
scene). For technical details, please refer to our
paper. Video credits: Mohit Bagra |
|
| Soure
Code |
|
|
| Dataset |
|
|