CN120259461A

Movatterモバイル変換

Info

Publication number: CN120259461A
Application number: CN202510294423.7A
Authority: CN
Inventors: 李永波; 胡一江; 杨跃
Original assignee: Seashell Housing Beijing Technology Co Ltd
Current assignee: Seashell Housing Beijing Technology Co Ltd
Priority date: 2025-03-12
Filing date: 2025-03-12
Publication date: 2025-07-04

Abstract

本公开实施例涉及一种图像生成模型的训练、图像生成方法、装置、设备及介质，其中该方法包括：获取训练全景图像样本和训练全景图像样本对应的描述文本输入待训练图像生成模型，以对训练全景图像样本对应的输入向量进行位置编码，得到输入向量对应的位置旋转矩阵，基于输入向量对应的位置旋转矩阵计算输出向量，基于训练全景图像样本的训练向量和输出向量之间的损失值调整待训练图像生成模型的模型参数得到图像生成模型。采用上述技术方案，通过位置旋转矩阵表示全景图像的像素点位置以训练图像生成模型，从而在全景图像生成过程中，基于位置旋转矩阵可以确定生成全景图像中各个像素点的距离信息，提高全景图像的生成效果。

The disclosed embodiments relate to a training, image generation method, device, equipment and medium for an image generation model, wherein the method comprises: obtaining a training panoramic image sample and a description text corresponding to the training panoramic image sample and inputting the image generation model to be trained, so as to positionally encode the input vector corresponding to the training panoramic image sample, obtain a position rotation matrix corresponding to the input vector, calculate an output vector based on the position rotation matrix corresponding to the input vector, and adjust the model parameters of the image generation model to be trained based on the loss value between the training vector and the output vector of the training panoramic image sample to obtain the image generation model. By adopting the above technical solution, the pixel position of the panoramic image is represented by the position rotation matrix to train the image generation model, so that in the process of generating the panoramic image, the distance information of each pixel in the generated panoramic image can be determined based on the position rotation matrix, thereby improving the generation effect of the panoramic image.