文生视频模型Sora的时间性结构分析——对生成式人工智能的现象学思考

扫码查看

原文链接

万方数据

中文摘要：近日,OpenAI推出了代表了目前文生视频最高水平的模型Sora,成为生成式人工智能发展史上的里程碑.然而,Sora还是存在着一些技术上的缺陷和不足.从时间现象学角度看,Sora外在时间结构"阵容"残缺,只有客观时间,没有主观时间和内在时间意识,导致其无法描述人类的心理时间,不能解释事件的因果关系和建构复杂有意义的事件及情节.此外,滞留和前摄的缺席,导致其无法连接动作和结果;缺少内在时间性动态生成结构的介入,Sora亦难以展现随着时间推移而发生的事件.因此,从技术层面增加数据模型的意向性实践和提升意向性设计的算量、算法,完善内外两个时间性结构,成为提升Sora现实表现的关键.

外文标题：The Temporal Structure of Text-to-Video Model Sora:A Phenomenological Reflection on Generative Artificial Intelligence

外文摘要：Recently,OpenAI launched Sora,a model that represents the current pinnacle of text-to-video technology,marking a milestone in the evolution of generative artificial intelligence.However,Sora still has some technical flaws and shortcomings.From a phenomenological perspective,Sora's external temporal structure is incomplete,featuring only objective time,lacking subjective time and inner time consciousness,which prevents it from depicting human psychological time,explaining causal relationships,and constructing complex,meaningful events and plots.Moreover,the absence of retention and fore-shoot hinders its ability to link actions with outcomes.Without the intervention of the internal temporal dynamic generation structure,Sora is also difficult to show the events that occur over time.Therefore,from a technical standpoint,addressing the model's intentional design issues and enhancing both the internal and external temporal structures become the key to improving Sora's performance in reality.

外文关键词：

text-to-videoSoratemporal structuregenerative artificial intelligencephenomenologyretention and fore-shoot

作者：

邓志文

展开 >

作者单位：

湖北科技学院人文与传媒学院,湖北咸宁 437100

关键词：

文生视频 Sora 时间性结构生成式人工智能现象学滞留与前摄

基金：

2023年教育部人文社会科学规划基金项目湖北科技学院科研创新团队项目

项目编号：

23YJAZH0232022T06

出版年：

2024

DOI：

10.13786/j.cnki.cn14-1066/g2.2024.6.006

编辑之友

山西出版传媒集团有限责任公司

编辑之友

CSSCICHSSCD北大核心

影响因子：0.9

ISSN：1003-6687

年,卷(期)：2024.(6)

参考文献量6