Discussion about this post

User's avatar
Boyuan Xiao's avatar

"It is lumpy, dependent on whether the new data happens to contain novel physical phenomena or merely more instances of phenomena the model has already seen."

Not sure I get the difference to scaling language data -there are a lot of texts that say the same thing

To me the biggest limitation of large scale video data is that:

- It's 2D - there aren't many multi-cam captures with calibration data

- We can't capture forces, so limited to kinematics rather than understanding dynamics

- Frame rates are too low in most videos, e.g. we see aliasing on wheels for 30fps

2 more comments...

No posts

Ready for more?