Using photorealistic imagery (captured via photogrammetry, it is clear) also brings a level of diversity and realism that is obvious in retrospect. Sure, all beds look roughly the same, but what about unmade beds? All different!
Having objects that also animate to do their “main thing” if you will is also helpful. Knowing what a refrigerator, cabinet, book, laptop or garage door look like closed is one thing and open is another, but how does it get from A to B? It sounds simplistic but if AI models aren’t provided this information, they aren’t likely to invent or intuit it.
You can read more about the characteristics and details of this huge dataset