オープンソースのSora類似のビデオ生成モデル

このプロジェクトは北京大学と兎展AIGC合同ラボによって共同で立ち上げられ、OpenAIのテキストベース動画生成モデルであるSoraを再現することを目指しています。オープンソースコミュニティからの貢献を期待しており、Apache-2.0ライセンスを使用しています。

デモはこちらで使用できます：https://huggingface.co/spaces/LanguageBind/Open-Sora-Plan-v1.1.0

画像生成には通常50ステップが必要であり、動画生成には良好な結果を得るためには150ステップが必要となる場合がありますが、これは3〜4分かかることがあります。したがって、2秒の動画の生成プロセスも遅いです。

prompt: Extreme close-up of chicken and green pepper kebabs grilling on a barbeque with flames. Shallow focus and light smoke. vivid colours

prompt: A robot dog trots down a deserted alley at night, its metallic paws clinking softly on the cobblestones, the glow of its LED eyes piercing the darkness. Occasionally, it pauses to scan its surroundings with a soft, whirring sound.