Image and video generation models often miss cultural nuances and produce stereotyped or generic outputs — a problem ...