出版科学 ›› 2026, Vol. 34 ›› Issue (2): 50-58.

• 理论探索 • 上一篇    下一篇

基于生成式人工智能的虚拟现实媒介内容生成

  

  1. (马来西亚城市大学创意产业学院,吉隆坡,46100)(中国矿业大学计算机科学与技术学院 / 人工智能学院,徐州,221116
  • 收稿日期:2025-02-10 修回日期:2026-01-12 出版日期:2026-05-08 发布日期:2026-05-08
  • 作者简介:曹苇航,马来西亚城市大学创意产业学院2023级博士生、讲师;曹天杰,工学博士,中国矿业大学计算机科学与技术学院 / 人工智能学院教授、博士生导师。
  • 基金资助:

    G237

    本文系2023年江苏省高等教育教改研究立项课题(重点)“高校教师课堂质量AI辅助管理方法研究”(2023JSJG178)研究成果。


Content Generation of Virtual Reality Media Based on Generative Artificial Intelligence

  1. Faculty of Creative Industries, City University Malaysia, Kuala Lumpur, 46100)(School of Computer Science and Technology / School of Artificial Intelligence, China University of Mining and Technology, XuZhou, 221116
  • Received:2025-02-10 Revised:2026-01-12 Online:2026-05-08 Published:2026-05-08

摘要:

探讨生成式人工智能技术驱动下的虚拟现实媒介内容生成特征与机制。指出生成式人工智能赋能下的虚拟现实媒介具备四大特征 :一是通过自动化生成技术提升内容创作效率 ;二是支持多样化的感官体验 ;三是依托用户数据实现虚拟环境的个性化适配 ;四是支持虚实交互的实时性与自然性。这些特性在工程实现中面临的三大核心需求,包括跨模态数据的高效协同生成,虚拟环境与物理世界的动态一致性保持,以及多模态信息融合的沉浸感优化。为系统性地支撑这些核心需求,构建了“跨模态转换 - 虚实映射 - 物理仿真 - 多模态信息融合”为核心的内容生成四元机制模型。最后,指出在大规模产业化应用方面,虚拟现实媒介内容生成仍面临技术、伦理、规范和用户体验等多维度的挑战。

关键词:

生成式人工智能 虚拟现实媒介 模态对齐 数字孪生 多模态信息融合

Abstract:

This article explores the content generation characteristics and mechanisms of virtual reality media VRMdriven by generative artificial intelligence GenAI. The article identifies four features of VRM empowered by GenAI firstit enhances content creation efficiency through automated generation technology; secondit supports diverse sensory experiences; thirdit leverages user data analysis to achieve personalized adaptation of virtual environments; and fourthit ensures real-time and natural interaction between the virtual and physical worlds. The article reveals three core requirements for engineering implementation efficient collaborative generation of cross-modal datadynamic consistency between virtual environments and the physical worldand optimization of immersion through multimodal information fusion. To systematically address these requirementsa content generation quadrupole mechanism model is proposedcentered on cross-modal transformation - virtual-real mapping - physical simulation - multimodal information fusion. Finally, the article notes that VRM content generation for large-scale industrial applications still faces multidimensional challenges, including technology, ethics, regulations, and user experience.

Key words: Generative artificial intelligence Virtual reality media Modality alignment Digital twins Multimodal information fusion

中图分类号: