Movatterモバイル変換


[0]ホーム

URL:


CN120144595A - A method and system for constructing a digital twin model of a mobile terminal - Google Patents

A method and system for constructing a digital twin model of a mobile terminal
Download PDF

Info

Publication number
CN120144595A
CN120144595ACN202510632025.1ACN202510632025ACN120144595ACN 120144595 ACN120144595 ACN 120144595ACN 202510632025 ACN202510632025 ACN 202510632025ACN 120144595 ACN120144595 ACN 120144595A
Authority
CN
China
Prior art keywords
data
mobile terminal
model
retrieval
source heterogeneous
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202510632025.1A
Other languages
Chinese (zh)
Other versions
CN120144595B (en
Inventor
张俊曦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dalian Shuchen Technology Co ltd
Original Assignee
Dalian Shuchen Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dalian Shuchen Technology Co ltdfiledCriticalDalian Shuchen Technology Co ltd
Priority to CN202510632025.1ApriorityCriticalpatent/CN120144595B/en
Publication of CN120144595ApublicationCriticalpatent/CN120144595A/en
Application grantedgrantedCritical
Publication of CN120144595BpublicationCriticalpatent/CN120144595B/en
Activelegal-statusCriticalCurrent
Anticipated expirationlegal-statusCritical

Links

Classifications

Landscapes

Abstract

Translated fromChinese

本发明公开了一种移动终端数字孪生模型构建方法及系统,包括获取非关系型的多源异构移动终端数据;构建三维内存映射关系模型以获取移动终端数据库;为移动终端数据库提供数据检索服务,实现移动终端数据库中向量化数据的数据检索;根据数据检索的向量化数据构建虚拟实体映射空间;定义应用智能体,根据虚拟实体映射空间获取用于实现用户环境实时感知处理执行RPA的移动终端数字孪生模型。解决了目前信息技术不能有效实现非关系型的多源异构移动终端数据碎片信息的采集整合,无法对多源异构移动终端数据进行结构化存储的问题,同时不能为用户在管理自己的数据安全、维护数据权益、加工使用数据以及了解数据状况等方面提供技术支持的问题。

The present invention discloses a method and system for constructing a digital twin model of a mobile terminal, including obtaining non-relational multi-source heterogeneous mobile terminal data; constructing a three-dimensional memory mapping relationship model to obtain a mobile terminal database; providing data retrieval services for the mobile terminal database to realize data retrieval of vectorized data in the mobile terminal database; constructing a virtual entity mapping space based on the vectorized data retrieved from the data; defining an application agent, and obtaining a mobile terminal digital twin model for realizing real-time perception and processing of user environment and executing RPA based on the virtual entity mapping space. The method solves the problem that current information technology cannot effectively realize the collection and integration of non-relational multi-source heterogeneous mobile terminal data fragment information, cannot perform structured storage of multi-source heterogeneous mobile terminal data, and cannot provide technical support for users in managing their own data security, maintaining data rights, processing and using data, and understanding data status.

Description

Mobile terminal digital twin model construction method and system
Technical Field
The invention relates to the technical field of digital twin, in particular to a method and a system for constructing a digital twin model of a mobile terminal.
Background
With the continuous progress of big data and artificial intelligence technology, the core value of data in the tide of new generation information technology is more important, and the construction of reliable data ecology and the maximization of the value of data elements have become key guidelines for promoting the development of future data. In this process, not only can large enterprises be able to produce intuitive and high-value data, but individual users also generate a large amount of scattered data resources in daily activities, but the potential value of the data is often ignored. These data can map widely and deeply various aspects of the user's life. From daily consumption habits, social interactions to work and study situations. Not only basic information of the user, such as age, gender, geographical position and the like, but also richer details, such as interests, emotional states, psychological activities, health conditions of the user, behavior patterns and decision-making processes of the user in different scenes and the like, are contained. The data generated by users in the daily life is an important window for understanding their lifestyle, demand preferences and potential behavioral trends.
The service mode of the current information technology mainly depends on vendor providing, and a large amount of data generated and related by individual users in daily use, including personal information, use records, preference settings and the like, are almost stored and saved on servers or data platforms of the vendors, and moreover, the data of all large application service vendors cannot be interconnected and communicated, complete data export service cannot be provided for the users, and the data use cannot be effectively converted into actual rights of the users, so that the users face great difficulties and challenges in managing own data security, maintaining data rights, processing use data, knowing data conditions and the like. Because there is no powerful way to integrate these fragment information acquisitions at present, the personal life data value cannot be completely and effectively released. How to collect personal data effectively and convert it into more valuable information for use and circulation has become a problem to be solved.
The data storage and the use need to be completed in a local mobile terminal in order to ensure the safety of the user data while releasing the value of the user data. The prior art scheme has the following problems that firstly, data types are changeable and wide in source, multi-source heterogeneous data are difficult to effectively expand and utilize in a traditional mode of single-table storage or JSON format storage, secondly, when a large-model RAG technology is applied, a mobile terminal cannot operate due to huge word embedding models, so that a TF-IDF or BM25 algorithm is mostly adopted, semantic loss is caused, data utilization effect is affected, thirdly, an artificial intelligence technology is utilized to ensure user data safety, and meanwhile, feedback of value is brought to users, complexity of using AIGC, LLM, AGENT and other technologies by users is reduced, and a complete personal digital twin space is constructed.
Disclosure of Invention
The invention provides a mobile terminal digital twin model construction method and system for overcoming the technical problems.
In order to achieve the above object, the technical scheme of the present invention is as follows:
a mobile terminal digital twin model construction method comprises the following steps:
S1, acquiring and acquiring non-relational multi-source heterogeneous mobile terminal data;
The multi-source heterogeneous mobile terminal data comprises mobile terminal application data, hardware sensor data and user private data;
S2, constructing a three-dimensional memory mapping relation model;
Data storage is carried out on the multi-source heterogeneous mobile terminal data based on the three-dimensional memory mapping relation model, a three-dimensional memory mapping relation table for storing non-relation data is obtained, field names of the multi-source heterogeneous mobile terminal data are redefined into field indexes corresponding to data fields of the non-relation mobile terminal through the three-dimensional memory mapping table;
carrying out vectorization processing on the multi-source heterogeneous mobile terminal data acquired through the field index, and acquiring vectorized data to store and acquire a mobile terminal database;
S3, providing data retrieval service for the mobile terminal database to realize data retrieval operation of vectorized data in the mobile terminal database;
s4, constructing a virtual entity mapping space according to the vectorized data obtained by the data retrieval operation;
And defining an application agent applied to the virtual entity mapping space, and acquiring a mobile terminal digital twin model for realizing the execution of RPA (remote procedure association) by the real-time sensing processing of the user environment according to the virtual entity mapping space.
Further, the step S2 specifically includes the following steps:
S21, constructing a three-dimensional memory mapping relation model, which comprises a data table setting module, a field set acquisition module, a data type assignment module and a three-dimensional memory mapping relation module;
the data table setting module is used for setting a data table for data storage of the multi-source heterogeneous mobile terminal data according to the set SQLite database;
The field set acquisition module is used for adding data fields to the multi-source heterogeneous mobile terminal data stored in the data table to acquire a data field set F;
And f= { data_0, data_1,..data_n }, wherein data_n represents the n-th data field added;
the data type designating module is used for setting a type label set of each data field type in the data field set;
The three-dimensional memory mapping relation module is used for constructing a three-dimensional memory mapping relation according to the data field set and the type tag set, and acquiring a three-dimensional memory mapping table through the three-dimensional memory mapping relation;
The three-dimensional memory mapping relation is M, F×T type×N- & gt V, wherein M represents a shorthand form of M (F, T, N), T type represents a type tag set, N represents a unique identifier of a data record, V represents a data value containing all multi-source heterogeneous mobile terminal data to be stored, F represents a data field and F epsilon F, T represents a data type and T epsilon T type, and N represents a value corresponding to the record identifier and N epsilon N;
s22, redefining field names of the multi-source heterogeneous mobile terminal data into field indexes corresponding to the data fields of the non-relational mobile terminal through the acquired three-dimensional memory mapping table;
S23, based on the pre-trained continuous word bag model CBOW, carrying out vectorization processing on the multi-source heterogeneous mobile terminal data acquired through the field index, and acquiring vectorized data to store and acquire a mobile terminal database.
Further, the data retrieval service in S3 includes a data retrieval service based on semantic retrieval and a data retrieval service based on SQL retrieval;
the data retrieval service retrieval method based on semantic retrieval comprises the following steps:
constructing a plurality of data vector index navigation layer nodes for data retrieval through HNSW graph algorithm based on a mobile terminal database;
Acquiring edge weights of the data vector index navigation layer nodes based on a heuristic search method;
according to the edge weight and the data vector index navigation layer node, acquiring an index navigation chart structure related to data retrieval;
An embedded Vector search engine J Vector based on single instruction multiple data stream SIMD realizes the data index of a mobile terminal database according to an index navigation chart structure;
The searching method of the data searching service based on SQL searching comprises the following steps:
semantic analysis is carried out on the data of the mobile terminal database through a large language model LLM, and a structured query intention representation is obtained;
based on a pre-trained Text2SQL model, acquiring an SQL sentence according to the structured query intention representation;
And acquiring the SQL query statement based on the SQL statement according to the storage format of the local database and the field index characteristic of the vectorized data through the local data storage mode technology so as to realize the data index of the mobile terminal database according to the SQL query statement.
Further, the step S4 specifically includes the following steps:
S41, customizing an entity model and an entity event of any mobile terminal;
the method comprises the steps of constructing a promt text for acquiring a structural semantic description framework in a self-defined mode according to a physical model and a physical event of a mobile terminal through a natural language technology;
s42, invoking AIGC models of the meristematic diagram/meristematic video, and acquiring virtual digital entity model characterization according to the Prompt text;
S43, converting the Prompt text into a vector to be queried, and retrieving and acquiring multi-source heterogeneous mobile terminal data related to virtual digital entity model representation through the vector to be queried based on a data retrieval service provided by a mobile terminal database;
the method comprises the steps of combining a pre-trained large language model LLM, and acquiring an overview report of an entity event according to the retrieved and acquired multi-source heterogeneous mobile terminal data, wherein the overview report of the entity event is a virtual entity mapping space where a user is located;
S44, defining an application agent applied to the virtual entity mapping space, and determining an execution action instruction of a certain scene in the current user environment based on the overview report of the entity event retrieved and acquired by the virtual entity mapping space;
S45, executing the actions of the mobile terminal according to the executing action instructions so as to realize the function of executing RPA by the user environment real-time sensing processing, and further realize the construction of a digital twin model of the mobile terminal;
And the execution action instruction at least comprises an operation instruction for the interface of the user mobile terminal and an access instruction for the interface of the remote server terminal.
Further, the method for acquiring and acquiring the application data of the mobile terminal in S1 comprises an active acquisition method and a passive acquisition method of the application data based on barrier-free service;
the application data active acquisition method comprises the following steps:
S001, starting authority of barrier-free service of the android system:
enabling the permission to execute gesture operation permission setting for performing simulated click operation on the mobile terminal;
Enabling the permission setting of traversing window content for acquiring interface data;
S002, opening a suspension window for collecting application data of the mobile terminal;
S003, a background server is called to download an application data acquisition plug-in so as to acquire and acquire structured data of application data of the mobile terminal according to the position of the floating window based on simulated click operation;
the application data passive acquisition method comprises the following steps:
s100, starting authority of barrier-free service of an android system:
enabling the permission to execute gesture operation permission setting for performing simulated click operation on the mobile terminal;
Enabling the permission setting of traversing window content for acquiring interface data;
S101, monitoring the change of a mobile terminal page in real time by calling an obstacle-free service monitoring event tool, and when the page is changed, acquiring the structured data displayed on the mobile terminal page in real time by calling an interface active window node acquisition tool, wherein the acquisition action is not executed;
The method for calling the barrier-free service monitoring event tool to monitor the page change of the mobile terminal in real time comprises the following steps:
s1011, acquiring a clicking time when the mobile terminal receives an interface clicking event, and taking the clicking time as a first clicking time;
Acquiring a second clicking moment when the mobile terminal receives the interface clicking event, wherein the second clicking moment is the clicking moment corresponding to the interface clicking event after the interface clicking event corresponding to the first clicking moment;
s1012, acquiring a click time interval between the second click time and the first click time;
judging the size between the clicking time interval and a preset time interval threshold;
And if the clicking time interval is smaller than or equal to the preset time interval threshold, confirming that the page of the mobile terminal is unchanged.
Further, the S1 further includes a method for performing data compression on the multi-source heterogeneous mobile terminal data:
coding information source symbols in the multi-source heterogeneous mobile terminal data based on Huffman coding to obtain information source coding symbols;
Acquiring the occurrence probability of each source coding symbol, and carrying out data compression on multi-source heterogeneous mobile terminal data according to the source coding symbols through a probability statistical model based on the source;
and the expression of the probability statistical model based on the information source is,
Wherein: Representing a desired length of data compression; Representing the average code length; E represents the expected; representing the coding of symbols for all possible sourcesThe Xie represents the set of source code symbols; representing source coded symbolsProbability of occurrence; representing source coded symbolsCorresponding code length.
The mobile terminal digital twin model construction system comprises a data acquisition module, a data compression module, a three-dimensional memory mapping relation model construction module, a data vectorization module, a data retrieval service module, an application agent setting module and a mobile terminal digital twin model construction module;
the data acquisition module is used for acquiring and acquiring non-relational multi-source heterogeneous mobile terminal data;
The multi-source heterogeneous mobile terminal data comprises mobile terminal application data, hardware sensor data and user data;
the data compression module is used for carrying out data compression on the multi-source heterogeneous mobile terminal data to obtain compressed data;
The three-dimensional memory mapping relation model construction module is used for constructing a three-dimensional memory mapping relation model;
Data storage is carried out on the multi-source heterogeneous mobile terminal data based on the three-dimensional memory mapping relation model, a three-dimensional memory mapping relation table for storing non-relation data is obtained, field names of the multi-source heterogeneous mobile terminal data are redefined into field indexes corresponding to data fields of the non-relation mobile terminal through the three-dimensional memory mapping table;
The data vectorization module is used for vectorizing the multi-source heterogeneous mobile terminal data acquired through the field indexes to acquire vectorized data for storing and acquiring a mobile terminal database;
The data retrieval service module is used for providing data retrieval service for the mobile terminal database so as to realize the data retrieval operation of the vectorized data in the mobile terminal database;
And the data retrieval service comprises a data retrieval service based on semantic retrieval and a data retrieval service based on SQL retrieval;
The application agent setting module is used for setting application agents applied to the virtual entity mapping space, and the application agents at least comprise an information security agent and a health management agent;
The mobile terminal digital twin model construction module is used for constructing a virtual entity mapping space according to the vectorized data acquired by the data retrieval operation, and acquiring a mobile terminal digital twin model for realizing the user environment real-time perception processing execution RPA according to the virtual entity mapping space based on the application agent.
The method and the system have the advantages that the method and the system for constructing the digital twin model of the mobile terminal effectively realize the acquisition and integration of the data fragment information of the non-relational multi-source heterogeneous mobile terminal by carrying out data storage on the multi-source heterogeneous mobile terminal data based on the constructed three-dimensional memory mapping relation model, acquire the three-dimensional memory mapping relation table for storing the non-relational data, greatly improve the comprehensiveness of acquiring the digits of the user mobile terminal, redefine the field names of the multi-source heterogeneous mobile terminal data into field indexes corresponding to the data fields of the non-relational mobile terminal through the three-dimensional memory mapping table, carry out vectorization processing on the multi-source heterogeneous mobile terminal data acquired through the field indexes, acquire vectorized data to store and acquire a mobile terminal database, effectively realize the structured storage of the multi-source heterogeneous mobile terminal data, conveniently provide complete data export service for users, lay a foundation by defining an application intelligent body applied to a virtual entity mapping space, acquire a mobile terminal digital twin model for realizing the implementation of real-time sensing and RPA (RPA) on the basis of realizing the real-time sensing processing of a user environment according to the constructed virtual entity mapping space, and provide great knowledge of the user on the aspects of managing and maintenance of own data, use data, service data and application and data processing conditions.
Drawings
In order to more clearly illustrate the embodiments of the invention or the technical solutions of the prior art, the drawings that are needed in the embodiments or the description of the prior art will be briefly described below, it being obvious that the drawings in the following description are some embodiments of the invention and that other drawings can be obtained from them without inventive effort for a person skilled in the art.
FIG. 1 is a flow chart of a method for constructing a digital twin model of a mobile terminal according to the present invention;
FIG. 2 is a system block diagram of a mobile terminal digital twin model building system in the present embodiment;
fig. 3 is a schematic diagram of non-relational multi-source heterogeneous mobile terminal data in the present embodiment.
Detailed Description
For the purpose of making the objects, technical solutions and advantages of the embodiments of the present invention more apparent, the technical solutions of the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention, and it is apparent that the described embodiments are some embodiments of the present invention, but not all embodiments of the present invention. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
The embodiment provides a mobile terminal digital twin model construction method, as shown in fig. 1, comprising the following steps:
S1, acquiring and acquiring non-relational multi-source heterogeneous mobile terminal data;
The multi-source heterogeneous mobile terminal data comprises mobile terminal application data, hardware sensor data and user private data;
The intelligent mobile phone is characterized in that the intelligent mobile phone is used as a personal daily data acquisition terminal as the function of the intelligent mobile phone is continuously enhanced, meanwhile, the acquired all user data is required to be locally stored on the mobile terminal to be completely covered by combining with the prior art, and the personal data, namely the mobile terminal data, is divided into three main types, namely application data, data generated by the user using application services provided by various application manufacturers, sensor data based on data acquired by sensors and communication modules inside and outside the intelligent terminal, and data of ideas, records, behaviors and the like generated by the user at any time;
The application data is distributed among various large application manufacturers, the data export function is not provided, and the application data cannot be acquired in a traditional data web crawler mode due to different technical frameworks used by various system applications, so that acquiring and integrating application related data becomes extremely difficult;
The embodiment collects application data on the mobile intelligent terminal device in a manner that the user sees the application data, the collection manner of the application data comprises an application data active collection method and an application data passive collection method based on barrier-free service (Accessibility Service), the active collection is personal data collection of third party application, and the application data active collection method specifically comprises the following steps:
S001, starting the authority of barrier-free service of an Android (Android) system:
Comprises enabling the permission (can Perform Gestures) for executing gesture operation to set up the simulated click operation for the mobile terminal;
Enabling a traversal-allowed Window Content authority (CAN RETRIEVE Window Content) setting for acquiring interface data;
S002, starting a floating window for collecting application data of the mobile terminal, namely a movable window;
S003, a background server is called to download an application data acquisition plug-in so as to acquire and acquire structured data of application data of the mobile terminal according to the position of the floating window based on simulated click operation;
The method specifically comprises the following steps:
S0031, starting a third party APP according to a set third party APP package name;
S0032, acquiring a root node of an accessible tree in a current active window through an accessible service interface active window node acquisition tool (get Root In Active Window) so as to acquire the similarity between the current active window node tree and an expected setting interface node tree of a final data interface of a third-party APP, and further determining whether the current active window interface belongs to the expected setting interface of the final data interface of the third-party APP;
The method for obtaining the similarity between the current active window node tree and the expected setting interface node tree of the final data interface of the third party APP is a prior known technical means and is not repeated herein;
s0033, if the current active window interface belongs to the expected setting interface of the final data interface of the third party APP of the mobile terminal, the next-stage functional page of the current active window is the final data interface;
If the current active window interface does not belong to the expected setting interface of the final data interface of the third party APP of the mobile terminal, entering a next-stage functional page of the current active window, and repeatedly executing S132 until the current active window interface belongs to the expected setting interface of the final data interface of the third party APP of the mobile terminal, and acquiring a final data interface;
S0034, if the data of the final data interface are displayed in a list or loaded in real time through a network, executing S0035;
If the final data interface is presented in a picture form, executing S0036;
S0035, performing downward sliding page turning on a final data interface until no new data is added or acquisition requirements are met, traversing all node elements of the final data interface according to a method of an interface active window node acquisition tool (get Root In Active Window) of barrier-free service, and acquiring large character strings of all text data of the final data interface;
S0036, storing the pictures to a local or remote server, and splicing the picture paths to form large character strings of all the picture paths of the final data interface;
s0037, obtaining final data interface structured data of the third party APP of the mobile terminal based on the regular expression according to the large character string.
In this embodiment, the application data is passively collected, that is, some information seen by the user when using the APP, including data received by talking chat, browsing news, online shopping, brushing a circle of friends, and the like, and the latest received knowledge information of the user can be effectively known by collecting the data that the user views in real time, so as to identify the recent state of the user, and the like. The traditional mode used for collecting third party application data is OCR (optical character recognition) after screen capturing, but because of the performance problem of a mobile phone, the information seen by a user cannot be completely collected in real time, so the application provides an application data passive collection method based on barrier-free service (Accessibility Service), and the application data passive collection method specifically comprises the following steps:
S100, starting the authority of barrier-free service of an Android system, wherein the mobile terminal is a mobile phone of the Android system;
Comprises enabling the permission (can Perform Gestures) for executing gesture operation to set up the simulated click operation for the mobile terminal;
Enabling a traversal-allowed Window Content authority (CAN RETRIEVE Window Content) setting for acquiring interface data;
S101, monitoring the change of a mobile terminal page in real time by calling an obstacle-free service monitoring event tool (on Accessibility Event), and when the page is changed, acquiring the structured data displayed on the mobile terminal page in real time by calling an interface active window node acquisition tool (get Root In Active Window);
the method for calling the barrier-free service monitoring event tool (on Accessibility Event) to monitor the page change of the mobile terminal in real time comprises the following steps:
s1011, acquiring a clicking time when the mobile terminal receives an interface clicking event, and taking the clicking time as a first clicking time;
Acquiring a second clicking moment when the mobile terminal receives the interface clicking event, wherein the second clicking moment is the clicking moment corresponding to the interface clicking event after the interface clicking event corresponding to the first clicking moment;
s1012, acquiring a click time interval between the second click time and the first click time;
judging the size between the clicking time interval and a preset time interval threshold;
If the clicking time interval is smaller than or equal to the preset time interval threshold, confirming that the page of the mobile terminal is unchanged;
The hardware sensor data can well detect and identify the physical environment where the user is located, the NFC tag can finely transmit the real-time scene of the user to the set digital twin space, the physical space position where the user is located can be judged through WIFI and Bluetooth connection data, the data processing algorithm can identify the user-defined scene of the data such as GPS, gyroscope, gravity sensing and light sensing, and the state acquisition of the physical world where the user is located can be enriched by combining an external intelligent hardware sensor, so that intelligent help more fitting the physical state of the user can be made in the digital twin space.
The user private data is provided, namely, all private data in life can be recorded by a user through text, voice, photo and video, the user can be helped to digitize the previously unconscious information by collecting the data through a convenient and quick information recording mode, a digital twin space can be constructed to visualize the data by using an artificial intelligent large model technology, and solid and powerful basic data is provided for personal digital twin;
In a specific embodiment, the method for performing data compression on the multi-source heterogeneous mobile terminal data is further included in S1:
coding information source symbols in the multi-source heterogeneous mobile terminal data based on Huffman coding to obtain information source coding symbols;
Acquiring the occurrence probability of each source coding symbol, and carrying out data compression on multi-source heterogeneous mobile terminal data according to the source coding symbols through a probability statistical model based on the source;
The problem that the local Data processing speed cannot keep pace with the acquisition speed happens in the actual Data acquisition process is solved, vector Data retrieval is carried out on the user attention dimensional Data, namely, the acquired mobile terminal Data are processed by carrying out Data compression on the application scene corresponding to the user attention program, namely, the user setting through the back end recognition matching, according to the priority corresponding to the Data type, the Data are transmitted to a large model at fixed time, and the fact that the same information Source symbol content characteristic frequently occurs in the daily Data of the user is considered in the embodiment, wherein the Data information Source (Data Source) of the daily Data of the user is a core concept in the fields of Data science, information technology and communication, and the Data Source refers to the original Source of the Data or an entity/system for providing the Data. The method can be any medium, equipment, platform or environment for generating, storing or transmitting data, and the quality, reliability and type of a data information source directly influence the application value and analysis result of the data;
therefore, the embodiment adopts Huffman coding and defines probability according to the occurrence frequency of the signal source symbol in the data so as to realize the aim of data compression according to a signal source-based probability statistical model;
And the expression of the probability statistical model based on the information source is,
Wherein: Representing a desired length of data compression; Representing the average code length; E represents the expected; representing the coding of symbols for all possible sourcesThe Xie represents the set of source code symbols; representing source coded symbolsProbability of occurrence; Representing source symbolsThe source code symbol with large probability codes short and the source code symbol with small probability codes long;
S2, constructing a three-dimensional memory mapping relation model;
Data storage is carried out on the multi-source heterogeneous mobile terminal data based on the three-dimensional memory mapping relation model, a three-dimensional memory mapping relation table for storing non-relation data is obtained, field names of the multi-source heterogeneous mobile terminal data are redefined into field indexes corresponding to data fields of the non-relation mobile terminal through the three-dimensional memory mapping table;
carrying out vectorization processing on the multi-source heterogeneous mobile terminal data acquired through the field index, and acquiring vectorized data to store and acquire a mobile terminal database;
The application of the standard digital twin technique in this embodiment mainly focuses on the accurate one-to-one replication modeling of target objects in the physical world using high-precision three-dimensional models. Extensive data acquisition work is involved, i.e., data is densely collected from each key location point of a physical target entity to ensure that a physical map can be constructed in virtual digital space that is highly consistent with and complete to the physical object. The complexity and diversity of the physical world is such that model data exhibits multi-dimensional characteristics. In particular, for a complex-structure, functionally diverse, data-driven entity, modeling by considering only its basic morphology is far from sufficient. The requirements of users for digital twin applications also often have a high degree of individuation, and models in the virtual space need to be flexibly customized according to the actual requirements of the users. In view of this, the embodiment is based on a comprehensive data acquisition technology, and performs data processing and integration on various generalized data of a user in daily use by constructing a three-dimensional memory mapping relationship model to obtain comprehensive information about user preferences, behavior habits and potential requirements;
the acquired data has very wide sources including active and passive, real-time and historical, full and incremental dimensions, various data types, including structured, unstructured and semi-structured data, and the data specifically comprises various forms such as text, voice, pictures and video, so that in order to better construct a digital twin space model based on the data, an effective strategy needs to be adopted to preprocess the data, the data is conventionally structured according to a standard and then put into storage, but the acquired data sources are complicated, so that the integrity and expansibility of some data are lost, the user related private data cannot participate in the construction of a model due to dimensional uncertainty, and meanwhile, the original data are required to be stored, the multi-source heterogeneous data are required to be formatted and stored, the existing scheme can realize dynamic increase and decrease of fields and data tables through the non-relational data, but the selection of the non-relational data base is relatively limited under the condition that the calculation performance of a mobile terminal is weak, and the situation that the content of the fields can be understood is difficult to understand when the data is used, so that the three-dimensional heterogeneous data is mapped to realize the construction of the three-dimensional heterogeneous data through the mobile terminal;
in a specific embodiment, the step S2 specifically includes the following steps:
S21, constructing a three-dimensional memory mapping relation model, which comprises a data table setting module, a field set acquisition module, a data type assignment module and a three-dimensional memory mapping relation module;
the data table setting module is used for setting a data table for data storage of the multi-source heterogeneous mobile terminal data according to the set SQLite database;
The field set acquisition module is used for adding data fields to the multi-source heterogeneous mobile terminal data stored in the data table to acquire a data field set F;
And f= { data_0, data_1,..data_n }, wherein data_n represents the n-th data field added;
the data type designating module is used for setting a type label set of each data field type in the data field set;
The three-dimensional memory mapping relation module is used for constructing a three-dimensional memory mapping relation according to the data field set and the type tag set, and acquiring a three-dimensional memory mapping table through the three-dimensional memory mapping relation;
The three-dimensional memory mapping relation is M, F and T type is N and V, wherein M represents a shorthand form of M (F, T, N), T type represents a type tag set, N represents a unique identifier of a data record, V represents a data value containing all multi-source heterogeneous mobile terminal data to be stored, F represents a data field, F is E F, T represents a data type, T is E T type, N represents a value corresponding to the record identifier, N is E N, the model can store data from different sources due to the flexibility of F and T type, the data can be heterogeneous and can support standard SQL query data, the embodiment can easily query, update and maintain the relation among the data type fields through M mapping, pre-processing is carried out when the standard SQL data query is used, and the field names are redefined as field indexes corresponding to the fields through a three-dimensional memory mapping model, so that the potential value of the data can be fully exerted in the use process of the subsequent data;
s22, redefining field names of the multi-source heterogeneous mobile terminal data into field indexes corresponding to the data fields of the non-relational mobile terminal through the acquired three-dimensional memory mapping table;
S23, based on a pre-trained continuous word bag model CBOW, carrying out vectorization processing on multi-source heterogeneous mobile terminal data acquired through field indexes, and acquiring vectorized data to store and acquire a mobile terminal database;
In particular, in the embodiment, text data acquired by a mobile terminal is converted into vectors and original data mapping storage by combining a Word2vec with a self-defined dictionary and a Word vector mode, other unstructured or semi-structured data slices are converted into vectors and original data mapping storage by using TIKA tools, when the Word2vec is used, the problem of insufficient computing performance of the mobile terminal is considered, a Word2vec technology improved by Embedding is introduced to vectorize CBOW (Continuous Bag-of-Words Model) Continuous Word Bag Model pre-trained by using related data, specific lines are selected from a weight matrix directly to be transmitted to the next layer, lengthy matrix multiplication operation is avoided, and the Embedding layer is used for extracting Word ID corresponding lines (vectors) from the weight parameters, so that the computing amount is reduced, and meanwhile, the usability and practicability of data in subsequent modeling are greatly improved by combining with the common dictionary in the life and working of users;
According to the embodiment, non-relational data is stored by using a relational database SQLite which is more suitable for being embedded in a mobile terminal, a data table is created, data fields data_0 to data_63 are added, the data types of 64 fields are supported at most, the data types are specified by adding data_type fields, a three-dimensional memory mapping relation model is built to index original fields, multi-source heterogeneous data which is similar to the characteristics of a columnar database can be better stored, the relation among various data type fields is better maintained, and the method is more suitable for an OLAP (on-line analysis processing) scene, the expansibility of the data types and the fields is enhanced, and in addition, formatted storage can be realized by customizing the data format of any data type through a preset data system or a user;
S3, providing data retrieval service for the mobile terminal database to realize data retrieval operation of vectorized data in the mobile terminal database;
in a specific embodiment, the data retrieval service provided for the mobile terminal database comprises a data retrieval service based on semantic retrieval and a data retrieval service based on SQL retrieval;
Specifically, the traditional keyword matching algorithm only considers the similarity in terms of vocabulary, but ignores the relation between sentence structure and semantics, the semantic search algorithm can better understand the query intention of the user, and provides more accurate search results, and in order to accelerate the Vector search speed, the embodiment performs local search on related processed mobile terminal Data by using an embedded Vector search engine J Vector accelerated by a graph algorithm and SIMD (Single Instruction, multiple Data);
the data retrieval service retrieval method based on semantic retrieval comprises the following steps:
constructing a plurality of data vector index navigation layer nodes for data retrieval through HNSW graph algorithm based on a mobile terminal database;
Acquiring edge weights of the data vector index navigation layer nodes based on a heuristic search method;
according to the edge weight and the data vector index navigation layer node, acquiring an index navigation chart structure related to data retrieval;
An embedded Vector search engine J Vector based on single instruction multiple data stream SIMD realizes the data index of a mobile terminal database according to an index navigation chart structure;
specifically, the embodiment can automatically analyze the user problem through a large language model and Text2SQL and combining a local optimized data storage mode technology, and convert the user problem into a corresponding SQL query statement, so that the flow of database query is simplified, the accuracy and efficiency of the query are improved, the threshold of database operation is reduced, more users can conveniently acquire and utilize structured data, and complex database query work can be completed without professional SQL knowledge;
The searching method of the data searching service based on SQL searching comprises the following steps:
semantic analysis is carried out on the data of the mobile terminal database through a large language model LLM, and a structured query intention representation is obtained;
based on a pre-trained Text2SQL model, acquiring an SQL sentence according to the structured query intention representation;
Acquiring SQL query sentences based on the SQL query sentences according to the storage format of a local database and field index features of vectorized data through a local data storage mode technology so as to realize data index of the mobile terminal database according to the SQL query sentences;
s4, constructing a virtual entity mapping space according to the vectorized data obtained by the data retrieval operation;
Defining an application agent applied to a virtual entity mapping space, and acquiring a mobile terminal digital twin model for realizing the execution of RPA (remote procedure association) by the real-time sensing processing of a user environment according to the virtual entity mapping space;
In the process of constructing a virtual model of a physical entity applied to a digital twin space, the conventional method generally uses Unity 3D/Unreal Engine to perform pre-modeling, and acquires relevant data of the physical entity in real time to realize comprehensive synchronous mapping with the virtual entity. However, when modeling a digital twin space of a related entity of daily work and life of a user under a mobile terminal, the user can only select a system to perform digital twin construction on a physical entity which is modeled in advance, and can only display information of a preset foundation of the system, and the digital twin technology can not be applied to various aspects of life of the user to adapt to personalized requirements of the user;
The method specifically comprises the following steps:
S41, customizing an entity model and an entity event of any mobile terminal;
the method comprises the steps of constructing a promt text for acquiring a structural semantic description framework in a self-defined mode according to a physical model and a physical event of a mobile terminal through a natural language technology;
s42, invoking AIGC models of the meristematic diagram/meristematic video, and acquiring virtual digital entity model characterization according to the Prompt text;
S43, converting the Prompt text into a vector to be queried, and retrieving and acquiring multi-source heterogeneous mobile terminal data related to virtual digital entity model representation through the vector to be queried based on a data retrieval service provided by a mobile terminal database;
the method comprises the steps of combining a pre-trained large language model LLM, and acquiring an overview report of an entity event according to the retrieved and acquired multi-source heterogeneous mobile terminal data, wherein the overview report of the entity event is a virtual entity mapping space where a user is located;
s44, defining application agents applied to the virtual entity mapping space, such as an information security agent and a health management agent, and determining an execution action instruction of a certain scene in the current user environment based on the overview report of the entity event retrieved and acquired by the virtual entity mapping space;
S45, executing the action of the mobile terminal according to the executing action instruction so as to realize the function of executing RPA by the user environment real-time sensing processing, and further realize the construction of a digital twin model of the mobile terminal, wherein the executing action instruction at least comprises an operation instruction for a user mobile terminal interface and an access instruction for a remote server interface. According to the embodiment, through Accessibility Service barrier-free services, operations such as clicking, sliding and inputting of an interface can be executed to realize an RPA function, action recording is provided for a user to realize more convenient operation, a transparent suspension window is started to record a click coordinate path of the user, recording of the user action is performed, and linkage can be realized by associating the action with an application scene, for example, a digital twin space recognizes that a scene is connected to home and automatically connected with WIFI, the current environment temperature is not suitable, the action is automatically found out from Mijia application and then an air conditioner is opened, the problem of traditional rule-based scene automation is solved through large model scene recognition, and the linkage limitation among various applications is broken;
According to the embodiment, AIGC technology is adopted, a user is allowed to customize a physical model or object dimension of a mobile terminal, a Prompt is built through user natural language customization, a virtual digital entity is built through a text-generated graph/text-generated video, meanwhile, all relevant original data of the description entity are obtained through semantic retrieval of collected vectorized relevant data through the custom Prompt natural language, RAG (RETRIEVAL-augmented Generation) complete functions of local data of the mobile terminal are realized by combining a large language model LLM, the retrieved and matched data are processed through the large model to obtain an overview report of the digital entity, in addition, the user can redefine the Prompt, further process and return dimension information concerned by the user through the large model, and a placeholder is provided for real-time dimension information to be embedded into a large model answer result, so that high-rate information synchronization can be kept, and meanwhile, the user can further see and export detailed structured data related to the entity model through the provided data retrieval function;
The embodiment also comprises a mobile terminal digital twin model construction system, as shown in fig. 2, which comprises a data acquisition module, a data compression module, a three-dimensional memory mapping relation model construction module, a data vectorization module, a data retrieval service module, an application agent setting module and a mobile terminal digital twin model construction module;
the data acquisition module is used for acquiring and acquiring non-relational multi-source heterogeneous mobile terminal data;
The multi-source heterogeneous mobile terminal data comprises mobile terminal application data, hardware sensor data and user data;
the data compression module is used for carrying out data compression on the multi-source heterogeneous mobile terminal data to obtain compressed data;
The three-dimensional memory mapping relation model construction module is used for constructing a three-dimensional memory mapping relation model;
Data storage is carried out on the multi-source heterogeneous mobile terminal data based on the three-dimensional memory mapping relation model, a three-dimensional memory mapping relation table for storing non-relation data is obtained, field names of the multi-source heterogeneous mobile terminal data are redefined into field indexes corresponding to data fields of the non-relation mobile terminal through the three-dimensional memory mapping table;
The data vectorization module is used for vectorizing the multi-source heterogeneous mobile terminal data acquired through the field indexes to acquire vectorized data for storing and acquiring a mobile terminal database;
The data retrieval service module is used for providing data retrieval service for the mobile terminal database so as to realize the data retrieval operation of the vectorized data in the mobile terminal database;
And the data retrieval service comprises a data retrieval service based on semantic retrieval and a data retrieval service based on SQL retrieval;
The application agent setting module is used for setting application agents applied to the virtual entity mapping space, and the application agents at least comprise an information security agent and a health management agent;
The mobile terminal digital twin model construction module is used for constructing a virtual entity mapping space according to the vectorized data acquired by the data retrieval operation, and acquiring a mobile terminal digital twin model for realizing the user environment real-time perception processing execution RPA according to the virtual entity mapping space based on the application agent.
The model application of the mobile terminal digital twin model constructed in the embodiment is that through collected various data in daily life of a personal user, the user can customize complete digital twin virtual mapping of various entities in life, has detailed state checking and control on the entities, simultaneously provides application of the user to the whole twin space data, and grasps rich language knowledge and modes through pre-training on large-scale text data based on a large language model. The models can understand complex language structures, generate coherent and strong-logic texts, further enable a large language model to analyze queries or prompts input by users, understand the intention and the requirement behind the queries or prompts, and further conduct accurate analysis and decision. For example, a user needs movie recommendation, a semantic analysis model constructed by a daily relevant corpus of the user is used for carrying out vector retrieval on relevant data such as user preference, state, movies and the like according to a question of the user, submitting the relevant data to a large language model for processing response, and simultaneously providing various data AI AGENT (intelligent AGENT) for deeper mining on user data, for example, an information security intelligent AGENT can give a security report and repair suggestion through detecting all data of the user acquired in real time, and find whether the current behavior of the user is abnormal or not, find whether the user is suffering from fraud and the like through interface text dialogue content acquired in real time, and a health management intelligent AGENT gives accurate analysis based on real complete data to the acquired comprehensive analysis of the daily diet, movement, physiology, sleep and the like of the user.
The method has the beneficial effects that:
According to the embodiment, the three-dimensional memory mapping relation model is constructed to store data of the multi-source heterogeneous mobile terminal, a three-dimensional memory mapping relation table for storing non-relational data is obtained, collection and integration of data fragment information of the non-relational multi-source heterogeneous mobile terminal are effectively achieved, comprehensiveness of obtaining the user mobile terminal is greatly improved, field names of the multi-source heterogeneous mobile terminal data are redefined to field indexes corresponding to the non-relational mobile terminal data fields through the three-dimensional memory mapping relation table, vectorization processing is conducted on the multi-source heterogeneous mobile terminal data obtained through the field indexes, vectorization data are obtained to store and obtain a mobile terminal database, structural storage of the multi-source heterogeneous mobile terminal data is effectively achieved, foundation is laid for providing complete data export service for users conveniently, and a mobile terminal digital twin model for realizing real-time sensing processing of a user environment is obtained according to the constructed virtual entity mapping space, so that the user can greatly improve application technology conditions of managing data safety, maintenance data, processing service data and data of the user terminals and the like.
It should be noted that the above embodiments are merely for illustrating the technical solution of the present invention and not for limiting the same, and although the present invention has been described in detail with reference to the above embodiments, it should be understood by those skilled in the art that the technical solution described in the above embodiments can be modified or some or all of the technical features thereof can be equivalently replaced, and the modification or replacement does not make the essence of the corresponding technical solution deviate from the scope of the technical solution of the embodiments of the present invention.

Claims (7)

Translated fromChinese
1.一种移动终端数字孪生模型构建方法,其特征在于,包括以下步骤:1. A method for constructing a digital twin model of a mobile terminal, comprising the following steps:S1:采集并获取非关系型的多源异构移动终端数据;S1: Collect and obtain non-relational multi-source heterogeneous mobile terminal data;且多源异构移动终端数据包括移动终端应用数据、硬件传感器数据以及用户私人数据;And the multi-source heterogeneous mobile terminal data includes mobile terminal application data, hardware sensor data and user private data;S2:构建三维内存映射关系模型;S2: Construct a three-dimensional memory mapping relationship model;并基于三维内存映射关系模型对多源异构移动终端数据进行数据存储,获取用于存储非关系型数据的三维内存映射关系表,并通过三维内存映射表将多源异构移动终端数据的字段名,重新定义为非关系型移动终端数据字段所对应的字段索引;Based on the three-dimensional memory mapping relationship model, the multi-source heterogeneous mobile terminal data is stored, a three-dimensional memory mapping relationship table for storing non-relational data is obtained, and the field names of the multi-source heterogeneous mobile terminal data are redefined as field indexes corresponding to the non-relational mobile terminal data fields through the three-dimensional memory mapping table;对通过字段索引获取的多源异构移动终端数据进行向量化处理,获取向量化数据以存储并获取移动终端数据库;Vectorize the multi-source heterogeneous mobile terminal data obtained through field indexing, obtain the vectorized data to store and obtain the mobile terminal database;S3:为移动终端数据库提供数据检索服务,以实现移动终端数据库中向量化数据的数据检索操作;S3: Provide data retrieval services for the mobile terminal database to implement data retrieval operations for vectorized data in the mobile terminal database;S4:根据数据检索操作所获得的向量化数据构建虚拟实体映射空间;S4: constructing a virtual entity mapping space based on the vectorized data obtained by the data retrieval operation;定义应用于虚拟实体映射空间的应用智能体,并根据虚拟实体映射空间获取用于实现用户环境实时感知处理执行RPA的移动终端数字孪生模型。Define the application agent applied to the virtual entity mapping space, and obtain the digital twin model of the mobile terminal for realizing real-time perception and processing of the user environment and executing RPA based on the virtual entity mapping space.2.根据权利要求1所述的一种移动终端数字孪生模型构建方法,其特征在于,所述S2具体包括以下步骤:2. According to a method for constructing a digital twin model of a mobile terminal according to claim 1, it is characterized in that S2 specifically comprises the following steps:S21:构建三维内存映射关系模型,其包括数据表设定模块、字段集合获取模块、数据类型指定模块以及三维内存映射关系模块;S21: constructing a three-dimensional memory mapping relationship model, which includes a data table setting module, a field set acquisition module, a data type specifying module and a three-dimensional memory mapping relationship module;数据表设定模块用于根据设定的SQLite数据库,设定用于对多源异构移动终端数据进行数据存储的data数据表;The data table setting module is used to set a data table for storing multi-source heterogeneous mobile terminal data according to the set SQLite database;字段集合获取模块用于对data数据表中存储的多源异构移动终端数据,添加数据字段以获取数据字段集合F;The field set acquisition module is used to add data fields to obtain a data field set F for the multi-source heterogeneous mobile terminal data stored in the data table;且F={data_0,data_1,…,data_n};其中data_n表示添加的第n个数据字段;And F={data_0,data_1,…,data_n}; where data_n represents the nth data field added;数据类型指定模块用于设定数据字段集合中每个数据字段类型的类型标签集合;The data type specifying module is used to set a type tag set for each data field type in the data field set;三维内存映射关系模块用于根据数据字段集合与类型标签集合,构建三维内存映射关系式,并通过三维内存映射关系式获取三维内存映射表;The three-dimensional memory mapping relationship module is used to construct a three-dimensional memory mapping relationship according to a data field set and a type label set, and obtain a three-dimensional memory mapping table through the three-dimensional memory mapping relationship;且三维内存映射关系式为M:F×T type×N→V;其中M表示M(f,t,n)的简写形式;Ttype表示类型标签集合;N表示数据记录的唯一标识符;V表示包含所有需存储的多源异构移动终端数据的数据值;f表示数据字段且f∈F;t表示数据类型且t∈T type;n表示记录标识符对应的值且n∈N;And the three-dimensional memory mapping relationship is M:F×T type×N→V; where M represents the abbreviation of M(f, t, n); Ttype represents the type tag set; N represents the unique identifier of the data record; V represents the data value containing all multi-source heterogeneous mobile terminal data to be stored; f represents the data field and f∈F; t represents the data type and t∈T type; n represents the value corresponding to the record identifier and n∈N;S22:通过获取的三维内存映射表将多源异构移动终端数据的字段名,重新定义为非关系型移动终端数据字段所对应的字段索引;S22: redefine the field names of the multi-source heterogeneous mobile terminal data as field indexes corresponding to the non-relational mobile terminal data fields through the obtained three-dimensional memory mapping table;S23:基于预训练的连续词袋模型CBOW,对通过字段索引获取的多源异构移动终端数据进行向量化处理,获取向量化数据以存储并获取移动终端数据库。S23: Based on the pre-trained continuous bag of words model CBOW, vectorize the multi-source heterogeneous mobile terminal data obtained through field indexing, obtain vectorized data to store and obtain the mobile terminal database.3.根据权利要求1所述的一种移动终端数字孪生模型构建方法,其特征在于,S3中数据检索服务包括基于语义检索的数据检索服务与基于SQL检索的数据检索服务;3. A method for constructing a digital twin model of a mobile terminal according to claim 1, characterized in that the data retrieval service in S3 includes a data retrieval service based on semantic retrieval and a data retrieval service based on SQL retrieval;所述基于语义检索的数据检索服务的检索方法:The retrieval method of the data retrieval service based on semantic retrieval:基于移动终端数据库,通过HNSW图算法构建用于数据检索的若干数据向量索引导航层节点;Based on the mobile terminal database, several data vector index navigation layer nodes for data retrieval are constructed through the HNSW graph algorithm;并基于启发式搜索方法获取数据向量索引导航层节点的边权重;And based on the heuristic search method, the edge weights of the nodes in the data vector index navigation layer are obtained;根据边权重与数据向量索引导航层节点,获取关于数据检索的索引导航图结构;According to the edge weight and data vector index navigation layer nodes, obtain the index navigation graph structure for data retrieval;基于单指令多数据流SIMD的嵌入式向量搜索引擎J Vector,根据索引导航图结构实现对移动终端数据库的数据索引;J Vector, an embedded vector search engine based on SIMD, implements data indexing of mobile terminal databases according to the index navigation graph structure;所述基于SQL检索的数据检索服务的检索方法:The retrieval method of the data retrieval service based on SQL retrieval:通过大语言模型LLM对移动终端数据库的数据进行语义解析,获取结构化查询意图表示;The data in the mobile terminal database is semantically parsed through the large language model (LLM) to obtain structured query intent representation;基于预训练的Text2SQL模型,根据结构化查询意图表示获取SQL语句;Based on the pre-trained Text2SQL model, SQL statements are obtained according to the structured query intent representation;通过本地数据存储方式技术,基于SQL语句根据本地数据库的存储格式与向量化数据的字段索引特征获取SQL查询语句,以根据SQL查询语句实现对移动终端数据库的数据索引。Through local data storage technology, based on SQL statements, SQL query statements are obtained according to the storage format of the local database and the field index characteristics of vectorized data, so as to implement data indexing of the mobile terminal database according to the SQL query statements.4.根据权利要求3所述的一种移动终端数字孪生模型构建方法,其特征在于,所述S4具体包括以下步骤:4. A method for constructing a digital twin model of a mobile terminal according to claim 3, characterized in that S4 specifically comprises the following steps:S41:自定义任意移动终端的实体模型与实体事件;S41: Customize entity models and entity events of any mobile terminal;通过自然语言技术根据移动终端的实体模型与实体事件,自定义构建用于获取结构化语义描述框架的Prompt文本;Based on the entity model and entity events of the mobile terminal, a prompt text for obtaining a structured semantic description framework is customized through natural language technology.S42:调用文生图/文生视频的AIGC模型,根据Prompt文本获取虚拟数字实体模型表征;S42: calling the AIGC model of the Wensheng picture/Wensheng video, and obtaining the virtual digital entity model representation according to the Prompt text;S43:将Prompt文本转换为待查询向量,并基于移动终端数据库提供的数据检索服务,通过待查询向量检索并获取与虚拟数字实体模型表征相关的多源异构移动终端数据;S43: converting the prompt text into a vector to be queried, and based on a data retrieval service provided by a mobile terminal database, retrieving and acquiring multi-source heterogeneous mobile terminal data related to the representation of the virtual digital entity model through the vector to be queried;并结合预训练的大语言模型LLM,根据检索并获取的多源异构移动终端数据获取实体事件的总览报告,所述实体事件的总览报告即为用户所处的虚拟实体映射空间;In combination with the pre-trained large language model LLM, an overview report of entity events is obtained based on the retrieved and acquired multi-source heterogeneous mobile terminal data. The overview report of entity events is the virtual entity mapping space where the user is located;S44:定义应用于虚拟实体映射空间的应用智能体,并基于虚拟实体映射空间检索并获取的实体事件的总览报告,确定在当前用户环境下某一场景的执行动作指令;S44: defining an application agent applied to the virtual entity mapping space, and determining an execution action instruction for a certain scene in the current user environment based on an overview report of entity events retrieved and obtained by the virtual entity mapping space;S45:根据所述执行动作指令进行移动终端的动作执行,以实现用户环境实时感知处理执行RPA的功能,进而实现移动终端数字孪生模型的构建;S45: Execute the action of the mobile terminal according to the execution action instruction to realize the function of real-time perception and processing of the user environment and execution of the RPA, thereby realizing the construction of the digital twin model of the mobile terminal;且执行动作指令至少包括对用户移动端界面的操作指令与对远程服务端接口的访问指令。And the execution action instruction at least includes an operation instruction for the user mobile terminal interface and an access instruction for the remote server terminal interface.5.根据权利要求4所述的一种移动终端数字孪生模型构建方法,其特征在于,S1中对移动终端应用数据采集并获取的方法,包括基于无障碍服务的应用数据主动采集方法与应用数据被动采集方法;5. A method for constructing a digital twin model of a mobile terminal according to claim 4, characterized in that the method for collecting and acquiring application data of the mobile terminal in S1 includes an active application data collection method based on barrier-free services and a passive application data collection method;所述应用数据主动采集方法:The active application data collection method:S001:开启安卓系统的无障碍服务的权限:S001: Enable the accessibility service permissions of the Android system:包括启用允许执行手势操作权限设置用于对移动终端的模拟点击操作;Including enabling the permission setting of allowing gesture operations to be performed for simulating click operations on the mobile terminal;启用允许遍历窗口内容权限设置用于获取界面数据;Enable the permission setting of allowing traversal of window contents to obtain interface data;S002:开启对移动终端应用数据进行采集的悬浮窗;S002: Open a floating window for collecting mobile terminal application data;S003:调用后台服务器下载应用数据采集插件,以基于模拟点击操作根据悬浮窗所在位置采集并获取移动终端的应用数据的结构化数据;S003: calling the background server to download the application data collection plug-in, so as to collect and obtain the structured data of the application data of the mobile terminal according to the location of the floating window based on the simulated click operation;所述应用数据被动采集方法:The passive collection method of application data:S100:开启安卓系统的无障碍服务的权限:S100: Enable the accessibility service permissions of the Android system:包括启用允许执行手势操作权限设置用于对移动终端的模拟点击操作;Including enabling the permission setting of allowing gesture operations to be performed for simulating click operations on the mobile terminal;启用允许遍历窗口内容权限设置用于获取界面数据;Enable the permission setting of allowing traversal of window contents to obtain interface data;S101:通过调用无障碍服务监听事件工具实时监听移动终端页面的变化,且当页面发生变化时,通过调用界面活动窗口节点获取工具实现对移动终端页面实时展示的结构化数据进行采集;否则,不执行采集动作;S101: monitoring the changes of the mobile terminal page in real time by calling the accessibility service monitoring event tool, and when the page changes, collecting the structured data displayed in real time on the mobile terminal page by calling the interface active window node acquisition tool; otherwise, no collection action is performed;调用无障碍服务监听事件工具实时监听移动终端页面变化的方法:Method to call the accessibility service event monitoring tool to monitor the changes of mobile terminal pages in real time:S1011:获取移动终端接收到界面点击事件时的点击时刻,并作为第一点击时刻;S1011: Acquire the click time when the mobile terminal receives the interface click event, and use it as the first click time;获取移动终端接收到界面点击事件时的第二点击时刻,且第二点击时刻为在第一点击时刻对应的界面点击事件之后的界面点击事件对应的点击时刻;Acquire a second click time when the mobile terminal receives the interface click event, where the second click time is the click time corresponding to the interface click event after the interface click event corresponding to the first click time;S1012:获取第二点击时刻与第一点击时刻之间的点击时间间隔;S1012: Obtaining a click time interval between the second click moment and the first click moment;判断点击时间间隔与预设时间间隔阈值之间的大小;Determine the size of the click time interval and the preset time interval threshold;若点击时间间隔大于预设时间间隔阈值,则确认移动终端页面的发生变化;若点击时间间隔小于或等于预设时间间隔阈值,则确认移动终端页面的未发生变化。If the click time interval is greater than the preset time interval threshold, it is confirmed that the mobile terminal page has changed; if the click time interval is less than or equal to the preset time interval threshold, it is confirmed that the mobile terminal page has not changed.6.根据权利要求1所述的一种移动终端数字孪生模型构建方法,其特征在于,S1中还包括对多源异构移动终端数据进行数据压缩的方法:6. A method for constructing a digital twin model of a mobile terminal according to claim 1, characterized in that S1 also includes a method for compressing multi-source heterogeneous mobile terminal data:基于Huffman编码对多源异构移动终端数据中的信源符号进行编码,获取信源编码符号;Encode the source symbols in the multi-source heterogeneous mobile terminal data based on Huffman coding to obtain source coding symbols;获取各信源编码符号的出现概率,并通过基于信源的概率统计模型根据信源编码符号对多源异构移动终端数据进行数据压缩;Obtain the occurrence probability of each source coding symbol, and perform data compression on multi-source heterogeneous mobile terminal data according to the source coding symbols through a source-based probability statistical model;且基于信源的概率统计模型的表达式为And the expression of the probability statistical model based on the information source is ,式中:表示数据压缩的期望长度;表示平均码长;表示编码长度;E表示期望;表示对所有可能的信源编码符号进行求和的描述;Ξ表示信源编码符号的集合;表示信源编码符号出现的概率;表示信源编码符号对应的编码长度。Where: Indicates the expected length of data compression; represents the average code length; Indicates the encoding length; E indicates expectation; Represents all possible source coding symbols A description of the sum is performed; Ξ represents a set of source coding symbols; Indicates source coding symbol Probability of occurrence; Indicates source coding symbol The corresponding encoding length.7.一种基于权利要求1至6任意一项所述的移动终端数字孪生模型构建方法的系统,其特征在于,包括数据获取模块、数据压缩模块、三维内存映射关系模型构建模块、数据向量化模块、数据检索服务模块、应用智能体设定模块、移动终端数字孪生模型构建模块;7. A system based on the method for constructing a digital twin model of a mobile terminal according to any one of claims 1 to 6, characterized in that it comprises a data acquisition module, a data compression module, a three-dimensional memory mapping relationship model construction module, a data vectorization module, a data retrieval service module, an application agent setting module, and a mobile terminal digital twin model construction module;所述数据获取模块用于采集并获取非关系型的多源异构移动终端数据;The data acquisition module is used to collect and acquire non-relational multi-source heterogeneous mobile terminal data;且多源异构移动终端数据包括移动终端应用数据、硬件传感器数据以及用户数据;And the multi-source heterogeneous mobile terminal data includes mobile terminal application data, hardware sensor data and user data;所述数据压缩模块用于对多源异构移动终端数据进行数据压缩,获取压缩数据;The data compression module is used to compress multi-source heterogeneous mobile terminal data to obtain compressed data;所述三维内存映射关系模型构建模块用于构建三维内存映射关系模型;The three-dimensional memory mapping relationship model construction module is used to construct a three-dimensional memory mapping relationship model;并基于三维内存映射关系模型对多源异构移动终端数据进行数据存储,获取用于存储非关系型数据的三维内存映射关系表,并通过三维内存映射表将多源异构移动终端数据的字段名,重新定义为非关系型移动终端数据字段所对应的字段索引;Based on the three-dimensional memory mapping relationship model, the multi-source heterogeneous mobile terminal data is stored, a three-dimensional memory mapping relationship table for storing non-relational data is obtained, and the field names of the multi-source heterogeneous mobile terminal data are redefined as field indexes corresponding to the non-relational mobile terminal data fields through the three-dimensional memory mapping table;所述数据向量化模块用于对通过字段索引获取的多源异构移动终端数据进行向量化处理,获取向量化数据以存储并获取移动终端数据库;The data vectorization module is used to perform vectorization processing on multi-source heterogeneous mobile terminal data obtained through field indexing, obtain vectorized data to store and obtain a mobile terminal database;所述数据检索服务模块用于对移动终端数据库提供数据检索服务,以实现移动终端数据库中向量化数据的数据检索操作;The data retrieval service module is used to provide data retrieval services to the mobile terminal database to implement data retrieval operations on vectorized data in the mobile terminal database;且数据检索服务包括基于语义检索的数据检索服务与基于SQL检索的数据检索服务;And the data retrieval service includes a data retrieval service based on semantic retrieval and a data retrieval service based on SQL retrieval;所述应用智能体设定模块用于设定应用于虚拟实体映射空间的应用智能体,且应用智能体至少包括信息安全智能体与健康管理智能体;The application agent setting module is used to set the application agent applied to the virtual entity mapping space, and the application agent at least includes an information security agent and a health management agent;所述移动终端数字孪生模型构建模块用于根据数据检索操作所获取的向量化数据构建虚拟实体映射空间,并基于应用智能体根据虚拟实体映射空间,获取用于实现用户环境实时感知处理执行RPA的移动终端数字孪生模型。The mobile terminal digital twin model construction module is used to construct a virtual entity mapping space based on the vectorized data obtained by the data retrieval operation, and based on the application agent, obtain a mobile terminal digital twin model for realizing real-time perception of the user environment and executing RPA according to the virtual entity mapping space.
CN202510632025.1A2025-05-162025-05-16 A method and system for building a digital twin model of a mobile terminalActiveCN120144595B (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN202510632025.1ACN120144595B (en)2025-05-162025-05-16 A method and system for building a digital twin model of a mobile terminal

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN202510632025.1ACN120144595B (en)2025-05-162025-05-16 A method and system for building a digital twin model of a mobile terminal

Publications (2)

Publication NumberPublication Date
CN120144595Atrue CN120144595A (en)2025-06-13
CN120144595B CN120144595B (en)2025-08-29

Family

ID=95955680

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN202510632025.1AActiveCN120144595B (en)2025-05-162025-05-16 A method and system for building a digital twin model of a mobile terminal

Country Status (1)

CountryLink
CN (1)CN120144595B (en)

Citations (9)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20140129493A1 (en)*2012-10-112014-05-08Orboros, Inc.Method and System for Visualizing Complex Data via a Multi-Agent Query Engine
CN115115796A (en)*2022-07-042022-09-27上海交通大学Building digital twin frame based on semantic Web technology and modeling method
US20230108774A1 (en)*2020-02-272023-04-06Andrew J. ArcherAI Augmented Digital Platform And User Interface
CN118827411A (en)*2024-09-192024-10-22杭州亚信软件有限公司 A method and related device for constructing a digital twin network based on a large language model
US12135788B1 (en)*2021-07-302024-11-05Splunk Inc.Generating information technology incident risk score narratives
CN119026141A (en)*2024-10-252024-11-26大连数晨科技有限公司 A mobile terminal information security detection method
CN119167634A (en)*2024-09-112024-12-20四川中电启明星信息技术有限公司 A digital twin model two-way interactive method, system, device and medium based on data structure table
CN119580976A (en)*2024-07-312025-03-07上海联影智元医疗科技有限公司 A hospital support platform
CN119623025A (en)*2024-11-202025-03-14天津职业技术师范大学(中国职业培训指导教师进修中心) Automatic sorting digital twin system and construction method

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20140129493A1 (en)*2012-10-112014-05-08Orboros, Inc.Method and System for Visualizing Complex Data via a Multi-Agent Query Engine
US20230108774A1 (en)*2020-02-272023-04-06Andrew J. ArcherAI Augmented Digital Platform And User Interface
US12135788B1 (en)*2021-07-302024-11-05Splunk Inc.Generating information technology incident risk score narratives
CN115115796A (en)*2022-07-042022-09-27上海交通大学Building digital twin frame based on semantic Web technology and modeling method
CN119580976A (en)*2024-07-312025-03-07上海联影智元医疗科技有限公司 A hospital support platform
CN119167634A (en)*2024-09-112024-12-20四川中电启明星信息技术有限公司 A digital twin model two-way interactive method, system, device and medium based on data structure table
CN118827411A (en)*2024-09-192024-10-22杭州亚信软件有限公司 A method and related device for constructing a digital twin network based on a large language model
CN119026141A (en)*2024-10-252024-11-26大连数晨科技有限公司 A mobile terminal information security detection method
CN119623025A (en)*2024-11-202025-03-14天津职业技术师范大学(中国职业培训指导教师进修中心) Automatic sorting digital twin system and construction method

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
JAMES JIE PAN等: "Survey of vector database management systems", THE VLDB JOURNAL, 15 July 2024 (2024-07-15), pages 1591 - 1612*
STEFANO CESCO: "Smart agriculture and digital twins: Applications and challenges in a vision of sustainability", EUROPEAN JOURNAL OF AGRONOMY, 31 May 2023 (2023-05-31)*
万力勇;: "融入数字孪生的高校创客空间:意蕴、框架与功能――从物理空间、混合空间到映射空间之演进", 远程教育杂志, no. 03, 20 May 2020 (2020-05-20)*
周勇: "基于数字孪生的烟草物流中心拣选车间监测系统开发及应用研究", 中国优秀硕士学位论文全文数据库, 15 May 2025 (2025-05-15)*

Also Published As

Publication numberPublication date
CN120144595B (en)2025-08-29

Similar Documents

PublicationPublication DateTitle
JP2023502827A (en) How to acquire geographic knowledge
US10924551B2 (en)IRC-Infoid data standardization for use in a plurality of mobile applications
CN108292323A (en)Use the database manipulation of the metadata of data source
CN101799835A (en)Ontology-driven geographic information retrieval system and method
CN117332069B (en) Intelligent extraction method and system for building information model based on large language model
CN105786897B (en)For providing the context aware body constructing method for paying close attention to information of the user based on context aware
CN115269877A (en)Method, system and equipment for constructing domain entity and event double-center knowledge graph
CN117194763A (en)Method for recommending next POI based on user preference and space-time context information
Li et al.Spatio-temporal data fusion techniques for modeling digital twin City
CN118536950A (en)BIM and large model-based railway engineering consultation system and consultation method
CN119357408A (en) A method for constructing electric power knowledge graph based on large language model
CN105354339B (en)Content personalization providing method based on context
CN118013825A (en) Multi-level and multi-type urban space-time collaborative design methods, systems, terminals and media
CN119443116A (en) A multi-dimensional fusion of metaverse and vertical AI model collaborative innovation platform
CN119397038B (en) A method, system, device and storage medium for automatically constructing industrial knowledge graph
CN115827797A (en)Environmental data analysis and integration method and system based on big data
CN120144595B (en) A method and system for building a digital twin model of a mobile terminal
CN117056459B (en)Vector recall method and device
CN111814812B (en) Modeling method, device, storage medium, electronic device and scene recognition method
CN111638926A (en)Method for realizing artificial intelligence in Django framework
CN115392229A (en) A Text Prediction Method Based on Data Lake
Amja et al.Modeling and reasoning in context-aware systems based on relational concept analysis and description logic
Wang[Retracted] Design of an Integrated System of Psychological Health Education Resources for Vocational Students Using Fuzzy Clustering Algorithm Analysis
CN113868322A (en) A semantic structure analysis method, device, device, virtualization system, and medium
CN115495471B (en) Method and related device for predicting HQL execution statements based on user characteristics

Legal Events

DateCodeTitleDescription
PB01Publication
PB01Publication
SE01Entry into force of request for substantive examination
SE01Entry into force of request for substantive examination
GR01Patent grant

[8]ページ先頭

©2009-2025 Movatter.jp