首页|文生视频模型Sora的时间性结构分析——对生成式人工智能的现象学思考

文生视频模型Sora的时间性结构分析——对生成式人工智能的现象学思考

扫码查看
近日,OpenAI推出了代表了 目前文生视频最高水平的模型Sora,成为生成式人工智能发展史上的里程碑.然而,Sora还是存在着一些技术上的缺陷和不足.从时间现象学角度看,Sora外在时间结构"阵容"残缺,只有客观时间,没有主观时间和内在时间意识,导致其无法描述人类的心理时间,不能解释事件的因果关系和建构复杂有意义的事件及情节.此外,滞留和前摄的缺席,导致其无法连接动作和结果;缺少内在时间性动态生成结构的介入,Sora亦难以展现随着时间推移而发生的事件.因此,从技术层面增加数据模型的意向性实践和提升意向性设计的算量、算法,完善内外两个时间性结构,成为提升Sora现实表现的关键.
The Temporal Structure of Text-to-Video Model Sora:A Phenomenological Reflection on Generative Artificial Intelligence
Recently,OpenAI launched Sora,a model that represents the current pinnacle of text-to-video technology,marking a milestone in the evolution of generative artificial intelligence.However,Sora still has some technical flaws and shortcomings.From a phenomenological perspective,Sora's external temporal structure is incomplete,featuring only objective time,lacking subjective time and inner time consciousness,which prevents it from depicting human psychological time,explaining causal relationships,and constructing complex,meaningful events and plots.Moreover,the absence of retention and fore-shoot hinders its ability to link actions with outcomes.Without the intervention of the internal temporal dynamic generation structure,Sora is also difficult to show the events that occur over time.Therefore,from a technical standpoint,addressing the model's intentional design issues and enhancing both the internal and external temporal structures become the key to improving Sora's performance in reality.

text-to-videoSoratemporal structuregenerative artificial intelligencephenomenologyretention and fore-shoot

邓志文

展开 >

湖北科技学院人文与传媒学院,湖北咸宁 437100

文生视频 Sora 时间性结构 生成式人工智能 现象学 滞留与前摄

2023年教育部人文社会科学规划基金项目湖北科技学院科研创新团队项目

23YJAZH0232022T06

2024

编辑之友
山西出版传媒集团有限责任公司

编辑之友

CSSCICHSSCD北大核心
影响因子:0.9
ISSN:1003-6687
年,卷(期):2024.(6)
  • 6