技术领域technical field
本发明涉及文档转换应用技术领域,特别是涉及一种网页转换方法及装置。The invention relates to the technical field of document conversion applications, in particular to a web page conversion method and device.
背景技术Background technique
在用户的日常学习及工作中,可以通过浏览网页的方式来获得所需要的信息,针对比较重要的网页,用户常需要将其存储到本地。In the user's daily study and work, the required information can be obtained by browsing webpages, and for more important webpages, the user often needs to store them locally.
现有技术中,用户可以采用“粘贴、复制”的方式,将网页中的网页元素拷贝到本地文档进行存储。In the prior art, the user can use the "paste and copy" method to copy the web page elements in the web page to a local file for storage.
然而,这种“粘贴、复制”的方式需要用户进行多次操作,较为繁琐,无法将网页快速保存在本地。However, this method of "pasting and copying" requires the user to perform multiple operations, which is cumbersome and cannot quickly save the webpage locally.
发明内容Contents of the invention
本发明实施例的目的在于提供一种网页转换方法及装置,以实现将所浏览的重要网页自动快速地保存至本地文档。The purpose of the embodiments of the present invention is to provide a method and device for converting webpages, so as to automatically and quickly save important webpages browsed to local files.
为达到上述目的,本发明实施例公开了一种网页转换方法,包括步骤:In order to achieve the above object, the embodiment of the present invention discloses a web page conversion method, comprising steps:
接收用户对待转换网页的转换操作;Receive the user's conversion operation on the webpage to be converted;
根据所述转换操作对所述待转换网页进行解析,获得解析结果,所述解析结果至少包括:所述待转换网页中的网页元素的网页元素类型;Analyzing the webpage to be converted according to the conversion operation to obtain a parsing result, the parsing result at least including: a webpage element type of a webpage element in the webpage to be converted;
确定进行转换后得到的目标文档的文档格式;Determine the document format of the target document obtained after conversion;
根据目标文档的文档元素类型与网页元素类型的预设类型对应关系,将所述待转换网页中的网页元素转换为类型相对应的文档元素,并保存于所确定的文档格式的文档中。According to the preset type correspondence between the document element type of the target document and the web page element type, the web page elements in the to-be-converted web page are converted into document elements corresponding to the types, and stored in the document in the determined document format.
较佳的,所述解析结果,还包括:所述待转换网页中的网页元素的坐标;Preferably, the parsing result also includes: the coordinates of the webpage elements in the webpage to be converted;
在所述根据目标文档的文档元素类型与网页元素类型的预设类型对应关系,将所述待转换网页中的网页元素转换为类型相对应的文档元素,并保存于所确定的文档格式的文档中之前,所述方法还包括:根据所述解析结果中的网页元素的坐标确定网页元素在所述待转换网页中的相对位置;According to the preset type correspondence between the document element type of the target document and the webpage element type, the webpage element in the webpage to be converted is converted into a document element corresponding to the type, and saved in a document in the determined document format Before, the method further includes: determining the relative position of the webpage element in the webpage to be converted according to the coordinates of the webpage element in the analysis result;
所述根据目标文档的文档元素类型与网页元素类型的预设类型对应关系,将所述待转换网页中的网页元素转换为类型相对应的文档元素,并保存于所确定的文档格式的文档中,包括:According to the preset type correspondence between the document element type of the target document and the webpage element type, the webpage element in the webpage to be converted is converted into a document element corresponding to the type, and stored in the document in the determined document format ,include:
根据目标文档的文档元素类型与网页元素类型的预设类型对应关系,将所述待转换网页中的网页元素转换为类型相对应的文档元素;converting the webpage elements in the webpage to be converted into document elements corresponding to the type according to the preset type correspondence between the document element type of the target document and the webpage element type;
对各网页元素:将该网页元素在所述待转换网页中的相对位置确定为该网页元素转换得到的文档元素在文档页面中的相对位置;For each web page element: determining the relative position of the web page element in the web page to be converted as the relative position of the document element converted from the web page element in the document page;
对各网页元素:将该网页元素设置于所确定的文档格式的文档页面中的所确定的相对位置中,并进行保存。For each webpage element: setting the webpage element at the determined relative position on the document page in the determined document format, and saving it.
较佳的,在所述解析结果仅包括:所述待转换网页中的网页元素的网页元素类型时,所述根据目标文档的文档元素类型与网页元素类型的预设类型对应关系,将所述待转换网页中的网页元素转换为类型相对应的文档元素,并保存于所确定的文档格式的文档中,包括:Preferably, when the parsing result only includes: the webpage element type of the webpage element in the webpage to be converted, according to the preset type correspondence between the document element type of the target document and the webpage element type, the The webpage elements in the webpage to be converted are converted into document elements corresponding to the type, and stored in the document in the determined document format, including:
根据目标文档的文档元素类型与网页元素类型的预设类型对应关系,将所述待转换网页中的网页元素转换为类型相对应的文档元素;converting the webpage elements in the webpage to be converted into document elements corresponding to the type according to the preset type correspondence between the document element type of the target document and the webpage element type;
按照网页元素读取顺序,将各网页元素转换得到的文档元素依次逐行排列在目标文档中,并进行保存。According to the reading order of the web page elements, the document elements converted from each web page element are arranged line by line in the target document and saved.
较佳的,所述确定进行转换后得到的目标文档的文档格式,包括:Preferably, said determining the document format of the target document obtained after conversion includes:
接收用户的文档格式选取操作;Receive the user's document format selection operation;
将用户所选取的文档格式确定进行转换后得到的目标文档的文档格式。The document format of the target document obtained after conversion is determined by the document format selected by the user.
较佳的,所述确定进行转换后得到的目标文档的文档格式,包括:Preferably, said determining the document format of the target document obtained after conversion includes:
根据所述解析结果中的网页元素类型,确定进行转换后得到的目标文档的文档格式。According to the webpage element type in the parsing result, the document format of the target document obtained after conversion is determined.
较佳的,所述根据所述解析结果中的网页元素类型,确定进行转换后得到的目标文档的文档格式,包括:Preferably, determining the document format of the converted target document according to the web page element type in the parsing result includes:
判断所述网页元素类型中是否具有多媒体元素,如果是,确定进行转换后得到的目标文档的文档格式为演示文稿;Judging whether there is a multimedia element in the webpage element type, if so, determining that the document format of the target document obtained after conversion is a presentation;
在判断为否的情况下,进一步判断所述网页元素类型中是否具有表格元素,如果是,确定进行转换后得到的目标文档的文档格式为电子表格;如果否,确定进行转换后得到的目标文档的文档格式为文本文档。In the case of judging as no, further judge whether there is a form element in the web page element type, if yes, determine that the document format of the target document obtained after conversion is a spreadsheet; if not, determine the target document obtained after conversion The document format is a text document.
为达到上述目的,本发明实施例公开了一种网页转换装置,包括:In order to achieve the above purpose, the embodiment of the present invention discloses a web page converting device, comprising:
转换操作接收模块,用于接收用户对待转换网页的转换操作;A conversion operation receiving module, configured to receive a user conversion operation on a webpage to be converted;
解析结果获得模块,用于根据所述转换操作对所述待转换网页进行解析,获得解析结果,所述解析结果至少包括:所述待转换网页中的网页元素的网页元素类型;An analysis result obtaining module, configured to analyze the webpage to be converted according to the conversion operation, and obtain an analysis result, the analysis result at least including: the webpage element type of the webpage element in the webpage to be converted;
文档格式确定模块,用于确定进行转换后得到的目标文档的文档格式;A document format determination module, configured to determine the document format of the target document obtained after conversion;
网页转换模块,用于根据目标文档的文档元素类型与网页元素类型的预设类型对应关系,将所述待转换网页中的网页元素转换为类型相对应的文档元素,并保存于所确定的文档格式的文档中。A webpage conversion module, configured to convert the webpage elements in the webpage to be converted into document elements corresponding to the type according to the corresponding relationship between the document element type of the target document and the preset type of webpage element type, and save it in the determined document format document.
较佳的,所述解析结果获得模块所获得的解析结果还包括:所述待转换网页中的网页元素的坐标,所述装置还包括:第一相对位置确定模块;Preferably, the analysis result obtained by the analysis result obtaining module further includes: the coordinates of the webpage elements in the webpage to be converted, and the device further includes: a first relative position determination module;
所述第一相对位置确定模块,用于在所述网页转换模块根据目标文档的文档元素类型与网页元素类型的预设类型对应关系,将所述待转换网页中的网页元素转换为类型相对应的文档元素,并保存于所确定的文档格式的文档中之前,根据所述解析结果中的网页元素的坐标确定网页元素在所述待转换网页中的相对位置;The first relative position determination module is used to convert the webpage elements in the webpage to be converted into corresponding types according to the preset type correspondence between the document element type of the target document and the webpage element type in the webpage conversion module document elements, and before saving them in the document in the determined document format, determine the relative position of the webpage elements in the webpage to be converted according to the coordinates of the webpage elements in the parsing result;
所述网页转换模块,包括:网页元素转换子模块、第二相对位置确定子模块和文档保存子模块:The webpage conversion module includes: a webpage element conversion submodule, a second relative position determination submodule and a document preservation submodule:
所述网页元素转换子模块,用于根据目标文档的文档元素类型与网页元素类型的预设类型对应关系,将所述待转换网页中的网页元素转换为类型相对应的文档元素;The webpage element conversion submodule is used to convert the webpage elements in the webpage to be converted into corresponding document elements according to the corresponding relationship between the document element type of the target document and the preset type of webpage element type;
所述第二相对位置确定子模块,用于将所述第一相对位置确定模块所确定的各网页元素在所述待转换网页中的相对位置确定为该网页元素转换得到的文档元素在文档页面中的相对位置;The second relative position determining submodule is configured to determine the relative position of each web page element determined by the first relative position determining module in the web page to be converted as the document element converted from the web page element on the document page relative position in
所述文档保存子模块,用于将各网页元素设置于所述第二相对位置确定模块所确定的文档格式的文档页面中的所确定的相对位置中,并进行保存。The document saving sub-module is configured to set and save each webpage element at the determined relative position on the document page in the document format determined by the second relative position determining module.
较佳的,所述解析结果获得模块所获得的解析结果仅包括网页元素类型,所述网页转换模块,包括:网页元素转换子模块和元素位置排列子模块:Preferably, the parsing result obtained by the parsing result obtaining module only includes web page element types, and the web page conversion module includes: a web page element conversion submodule and an element position arrangement submodule:
所述网页元素转换子模块,用于根据目标文档的文档元素类型与网页元素类型的预设类型对应关系,将所述待转换网页中的网页元素转换为类型相对应的文档元素;The webpage element conversion submodule is used to convert the webpage elements in the webpage to be converted into corresponding document elements according to the corresponding relationship between the document element type of the target document and the preset type of webpage element type;
所述元素位置排列子模块,用于按照网页元素读取顺序,将各网页元素转换得到的文档元素依次逐行排列在目标文档中,并进行保存。The element position arranging submodule is used for arranging the document elements converted from each web page element row by row in the target document according to the reading order of the web page elements, and saving them.
较佳的,所述文档格式确定模块,包括:格式选取操作接收子模块和格式确定子模块;Preferably, the document format determination module includes: a format selection operation receiving submodule and a format determination submodule;
所述格式选取操作接收子模块,用于接收用户的文档格式选取操作;The format selection operation receiving submodule is used to receive the user's document format selection operation;
所述目标文档格式手动确定子模块,用于将用户所选取的文档格式确定进行转换后得到的目标文档的文档格式。The target document format manual determination sub-module is used to determine the document format selected by the user and convert the document format of the target document to be obtained.
较佳的,所述文档格式确定模块,具体用于:根据所述解析结果中的网页元素类型,确定进行转换后得到的目标文档的文档格式。Preferably, the document format determination module is specifically configured to: determine the document format of the target document obtained after conversion according to the webpage element type in the parsing result.
较佳的,所述文档格式确定模块,包括:第一格式确定子模块和第二格式确定子模块;Preferably, the document format determination module includes: a first format determination submodule and a second format determination submodule;
所述第一格式确定子模块,用于判断所述网页元素类型中是否具有多媒体元素,如果是,则触发演示文稿确定子模块,否则触发第二格式确定子模块;The first format determination submodule is used to judge whether there is a multimedia element in the webpage element type, if so, trigger the presentation determination submodule, otherwise trigger the second format determination submodule;
所述演示文稿确定子模块,用于确定进行转换后得到的目标文档的文档格式为演示文稿;The presentation determining submodule is used to determine that the document format of the converted target document is a presentation;
所述第二格式确定子模块,用于判断所述网页元素类型中是否具有表格元素,如果是,则触发表格确定子模块,否则触发文本文档确定子模块;The second format determination submodule is used to judge whether there is a form element in the webpage element type, if yes, trigger the form determination submodule, otherwise trigger the text document determination submodule;
所述表格确定子模块,用于确定进行转换后得到的目标文档的文档格式为电子表格;The form determination submodule is used to determine that the document format of the target document obtained after conversion is an electronic form;
所述文本文档确定子模块,用于确定进行转换后得到的目标文档的文档格式为文本文档。The text document determining submodule is used to determine that the document format of the target document obtained after conversion is a text document.
本发明实施例提供的一种网页转换方法及装置,可以对待转换网页进行解析并得到解析结果,文档格式确定模块根据得到的解析结果或用户的文档格式确定操作,确定转换后的目标文档的文档格式,根据所确定的文档格式,网页转换模块将解析结果中的网页元素转换为在目标文档中的文档元素,并自动保存至本地。由此可见,应用本发明实施例可以直接对待转换的网页进行处理,无需用户手动反复采用粘贴复制的方法将网页本地化,因而能够将用户所浏览的网页内容自动快速地保存至本地文档,用户操作更便捷。A webpage conversion method and device provided by the embodiments of the present invention can analyze the webpage to be converted and obtain the analysis result, and the document format determination module determines the document of the converted target document according to the obtained analysis result or the user's document format determination operation format, according to the determined document format, the web page conversion module converts the web page elements in the parsing result into document elements in the target document, and automatically saves them locally. It can be seen that the application of the embodiment of the present invention can directly process the webpage to be converted, without the need for the user to manually use the method of pasting and copying to localize the webpage, so that the content of the webpage browsed by the user can be automatically and quickly saved to the local file, and the user The operation is more convenient.
附图说明Description of drawings
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present invention or the prior art, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the accompanying drawings in the following description are only These are some embodiments of the present invention. Those skilled in the art can also obtain other drawings based on these drawings without creative work.
图1为本发明实施例提供的一种网页转换方法的一种流程示意图;FIG. 1 is a schematic flow chart of a web page conversion method provided by an embodiment of the present invention;
图2为本发明实施例提供的一种网页转换方法的另一种流程示意图;Fig. 2 is another schematic flow chart of a webpage conversion method provided by an embodiment of the present invention;
图3为本发明实施例提供的一种网页转换方法的另一种流程示意图;Fig. 3 is another schematic flow chart of a webpage conversion method provided by an embodiment of the present invention;
图4为本发明实施例提供的一种网页转换方法的另一种流程示意图;FIG. 4 is another schematic flow chart of a webpage conversion method provided by an embodiment of the present invention;
图5为本发明实施例提供的一种网页转换装置的一种结构示意图;FIG. 5 is a schematic structural diagram of a webpage conversion device provided by an embodiment of the present invention;
图6为本发明实施例提供的一种网页转换装置的另一种结构示意图;FIG. 6 is another structural schematic diagram of a webpage conversion device provided by an embodiment of the present invention;
图7为本发明实施例提供的一种网页转换装置的另一种结构示意图;FIG. 7 is another structural schematic diagram of a webpage conversion device provided by an embodiment of the present invention;
图8为本发明实施例提供的一种网页转换方法的另一种结构示意图。FIG. 8 is another structural schematic diagram of a webpage conversion method provided by an embodiment of the present invention.
具体实施方式detailed description
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.
首先对本发明实施例提供的一种网页转换方法进行说明,该方法可以包括以下步骤:First, a method for converting a webpage provided by an embodiment of the present invention is described, and the method may include the following steps:
接收用户对待转换网页的转换操作;Receive the user's conversion operation on the webpage to be converted;
根据所述转换操作对所述待转换网页进行解析,获得解析结果,所述解析结果至少包括:所述待转换网页中的网页元素的网页元素类型;Analyzing the webpage to be converted according to the conversion operation to obtain a parsing result, the parsing result at least including: a webpage element type of a webpage element in the webpage to be converted;
确定进行转换后得到的目标文档的文档格式;Determine the document format of the target document obtained after conversion;
根据目标文档的文档元素类型与网页元素类型的预设类型对应关系,将所述待转换网页中的网页元素转换为类型相对应的文档元素,并保存于所确定的文档格式的文档中。According to the preset type correspondence between the document element type of the target document and the web page element type, the web page elements in the to-be-converted web page are converted into document elements corresponding to the types, and stored in the document in the determined document format.
由此可见,应用本发明实施例可以直接对待转换的网页进行处理,无需用户手动反复采用粘贴复制的方法将网页本地化,因而能够将用户所浏览的网页内容自动快速地保存至本地文档,用户操作更便捷。It can be seen that the application of the embodiment of the present invention can directly process the webpage to be converted, without the need for the user to manually use the method of pasting and copying to localize the webpage, so that the content of the webpage browsed by the user can be automatically and quickly saved to the local file, and the user The operation is more convenient.
下面对本发明实施例所提供的一种网页转换方法的步骤进行详细介绍。The steps of a method for converting a webpage provided by an embodiment of the present invention are described in detail below.
图1为本发明实施例提供的一种网页转换方法的一种流程示意图,该方法可以包括以下步骤:Fig. 1 is a schematic flow chart of a webpage conversion method provided by an embodiment of the present invention, the method may include the following steps:
步骤S101:接收用户对待转换网页的转换操作。Step S101: Receive a user's conversion operation on the webpage to be converted.
其中,接收到的用户对待转换网页的转换操作,可以是针对整页网页内容的转换操作,还可以是针对网页中的部分网页内容的转换操作,用户可以选择待转换的网页内容区域范围。Wherein, the conversion operation received by the user on the webpage to be converted may be a conversion operation on the entire webpage content, or may be a conversion operation on a part of the webpage content in the webpage, and the user may select the range of the content area of the webpage to be converted.
步骤S102:根据所述转换操作对所述待转换网页进行解析,获得解析结果,所述解析结果至少包括:所述待转换网页中的网页元素的网页元素类型。Step S102: Analyzing the to-be-converted webpage according to the conversion operation to obtain a parsing result, the parsing result at least including: webpage element types of webpage elements in the to-be-converted webpage.
在本发明的一种具体实施例中,该解析结果可以包括:In a specific embodiment of the present invention, the parsing result may include:
待转换网页中的网页元素的网页元素类型和待转换网页中的网页元素的坐标。The webpage element type of the webpage element in the webpage to be converted and the coordinates of the webpage element in the webpage to be converted.
需要说明的是,这里所说的待转换网页中的网页元素的坐标,可以在生成网页时的CSS网页样式文档中解析得到。It should be noted that the coordinates of the webpage elements in the webpage to be converted mentioned here can be obtained by analyzing the CSS webpage style document when generating the webpage.
在本发明的另一种具体实施例中,该解析结果可以包括:In another specific embodiment of the present invention, the analysis result may include:
待转换网页中的网页元素的网页元素类型。The web element type of the web element in the web page to be transformed.
需要说明的是,对于所获得解析结果仅有待转换网页中的网页元素的网页元素类型的情况,通常是没有CSS网页样式文档与之对应的,对于这种情况,需要按照网页元素的读取顺序设置转换后的文档元素在转换后的目标文档中的位置。It should be noted that, for the case where the obtained parsing result is only the webpage element type of the webpage element to be converted, usually there is no CSS webpage style document corresponding to it. In this case, the reading order of the webpage elements needs to be followed. Sets the position of the transformed document element in the transformed target document.
步骤S103:确定进行转换后得到的目标文档的文档格式。Step S103: Determine the document format of the target document obtained after conversion.
在实际应用中,确定进行转换后得到的目标文档的文档格式,可以包括以下两种方式:In practical applications, determining the document format of the target document obtained after conversion may include the following two methods:
方式一:method one:
接收用户的文档格式选取操作;Receive the user's document format selection operation;
将用户所选取的文档格式确定进行转换后得到的目标文档的文档格式。The document format of the target document obtained after conversion is determined by the document format selected by the user.
需要说明的是,采用方式一所确定的进行转换后得到的目标文档的文档格式,对于待转换网页中的网页元素会丢失网页元素的功能。举例而言,如果用户所选取的文档格式为文本文档格式,根据方式一中的方法,确定进行转化后得到的目标文档的文档格式为文本文档格式,假设,待转换网页中的网页元素包括视频元素,那么,将该视频元素转换到目标文档中时视频元素无法播放,丢失了视频元素的功能。It should be noted that, if the document format of the target document obtained after the conversion determined by the first method is adopted, the function of the web page element will be lost for the web page element in the web page to be converted. For example, if the document format selected by the user is a text document format, according to the method in the first method, it is determined that the document format of the target document obtained after conversion is a text document format, assuming that the webpage elements in the webpage to be converted include video element, then the video element cannot be played when the video element is converted into the target document, and the function of the video element is lost.
方式二:Method 2:
根据解析结果中的网页元素类型,确定进行转换后得到的目标文档的文档格式。According to the element type of the web page in the parsing result, the document format of the target document obtained after conversion is determined.
参见图2,步骤S103,可以包括:Referring to Fig. 2, step S103 may include:
步骤S103a:判断网页元素类型中是否具有多媒体元素,如果是,执行步骤S103c,否则执行步骤S103b;Step S103a: determine whether there is a multimedia element in the element type of the web page, if yes, execute step S103c, otherwise execute step S103b;
步骤S103c:确定进行转换后得到的目标文档的文档格式为演示文稿;Step S103c: determining that the document format of the target document obtained after the conversion is a presentation;
步骤S103b:进一步判断网页元素类型中是否具有表格元素,如果是,执行步骤S103d,否则执行步骤S103e;Step S103b: further judge whether there is a table element in the element type of the web page, if yes, execute step S103d, otherwise execute step S103e;
步骤S103d:确定进行转换后得到的目标文档的文档格式为电子表格;Step S103d: Determine that the document format of the target document obtained after the conversion is a spreadsheet;
步骤S103e:确定进行转换后得到的目标文档的文档格式为文本文档。Step S103e: Determine that the document format of the converted target document is a text document.
需要说明的是,采用方式二所确定的进行转换后得到的目标文档的文档格式,能够尽可能地保留网页元素的功能。It should be noted that, adopting the document format of the converted target document determined in the second manner can retain the functions of the web page elements as much as possible.
在实际应用中,具体采用何种方式来确定待转换后得到的目标文档的文档格式还需结合实际情况而定,在对于有保留网页元素功能要求的情况下,可以采用方式二来确定;在没有保留网页元素功能需求或者用户有特殊格式要求的情况下,可以采用方式一来确定。In practical applications, the specific method to be used to determine the document format of the target document to be converted depends on the actual situation. In the case of retaining the function of web page elements, the second method can be used to determine; If the functional requirements of web page elements are not reserved or the user has special format requirements, method 1 can be used to determine.
步骤S104:根据目标文档的文档元素类型与网页元素类型的预设类型对应关系,将所述待转换网页中的网页元素转换为类型相对应的文档元素,并保存于所确定的文档格式的文档中。Step S104: According to the preset type correspondence between the document element type of the target document and the webpage element type, convert the webpage element in the webpage to be converted into a document element corresponding to the type, and save the document in the determined document format middle.
针对步骤S102中的一种具体实施例,上述解析结果还可以在获得待转换网页中的网页元素的网页元素类型的基础上,还获得待转换网页中的网页元素的坐标。For a specific embodiment in step S102, the above parsing result may also obtain the coordinates of the webpage element in the webpage to be converted on the basis of obtaining the webpage element type of the webpage element in the webpage to be converted.
在图1所示的步骤S104之前,所述方法还可以包括步骤S105:根据解析结果中的网页元素的坐标确定网页元素在待转换网页中的相对位置;Before step S104 shown in Figure 1, the method may also include step S105: determine the relative position of the webpage element in the webpage to be converted according to the coordinates of the webpage element in the parsing result;
参见图3,所述步骤S104,可以包括:Referring to FIG. 3, the step S104 may include:
步骤S104a:根据目标文档的文档元素类型与网页元素类型的预设类型对应关系,将待转换网页中的网页元素转换为类型相对应的文档元素;Step S104a: According to the preset type correspondence between the document element type of the target document and the webpage element type, convert the webpage element in the webpage to be converted into a document element corresponding to the type;
具体的,在确定目标文档的文档格式后,针对不同的网页元素,具有不同的网页元素转换对应关系。Specifically, after the document format of the target document is determined, different webpage element transformation correspondences are established for different webpage elements.
例如,当确定的目标文档的文档格式为演示文稿时,针对音频、视频等多媒体网页元素,首先在待转换网页的解析过程中,将音频、视频等多媒体网页元素的原文件从网页中下载至本地;然后按照演示文稿中添加多媒体文档元素的规则,将下载至本地的多媒体原文件确定为该演示文稿的文档元素,添加至演示文稿的文档中。For example, when the document format of the determined target document is a presentation, for multimedia webpage elements such as audio and video, at first in the parsing process of the webpage to be converted, the original files of multimedia webpage elements such as audio and video are downloaded from the webpage to local; then, according to the rules for adding multimedia document elements in the presentation, the original multimedia file downloaded to the local is determined as the document element of the presentation, and added to the document of the presentation.
例如,当确定的目标文档的文档格式为电子表格时,针对表格元素而言,首先确定网页元素中表格元素的属性(行高、列宽、字体、字号等);然后,按照所述网页元素中的表格属性确定为目标文档的表格属性,按照电子表格中添加表格的规则,将网页元素中的表格元素转换为目标文档中的表格元素。For example, when the document format of the determined target document is a spreadsheet, for the table element, first determine the attributes of the table element in the web page element (row height, column width, font, font size, etc.); then, according to the web page element The form attribute in is determined as the form attribute of the target document, and the form element in the webpage element is converted into the form element in the target document according to the rule of adding a form in the electronic form.
例如,当确定的目标文档的文档格式为文本文档时,针对图片元素而言,首先在待转换网页的解析过程中,将该图片的原文件下载至本地;然后,根据文本文档中的页面宽度对图片进行压缩处理,将压缩后的图片按照文本文档中图片添加规则,将网页元素中的图片元素转换为目标文档中的图片元素。For example, when the document format of the determined target document is a text document, for a picture element, firstly, the original file of the picture is downloaded to the local during the parsing process of the webpage to be converted; then, according to the page width in the text document, The image is compressed, and the compressed image is converted into an image element in the target document according to the image addition rules in the text document.
例如,当确定的目标文档的文档格式为文本文档时,针对超链接元素,首先确定超链接中文字的属性(字体、字号、颜色、链接到的目标地址等);然后,按照文本文档中超链接的添加规则,将网页元素中的超链接元素转换为具有相同属性的文本文档中的超链接元素。For example, when the document format of the determined target document is a text document, for the hyperlink element, at first determine the attributes of the words in the hyperlink (font, font size, color, target address linked to, etc.); Add rules for converting a hyperlink element in a web page element to a hyperlink element in a text document with the same attributes.
例如,当确定的目标文档的文档格式为文本文档时,针对图标元素,首先将图标元素中的图片下载至本地,然后将该图标元素的图片转换为文本文档中的图片元素。需要说明的是,当图标元素转换到目标文档中后,仅保留了该图标的图片,但是该图标在待转换网页中的功能已丢失。For example, when the document format of the determined target document is a text document, for the icon element, the picture in the icon element is first downloaded to the local, and then the picture of the icon element is converted into a picture element in the text document. It should be noted that after the icon element is converted into the target document, only the picture of the icon is retained, but the function of the icon in the webpage to be converted has been lost.
还需要说明的是,当所确定的目标文档的文档格式是由用户的文档格式选取操作所确定的,这种情况下,对于待转换网页中的网页元素可能会出现元素功能丢失,例如,对于包含多媒体网页元素的待转换网页,用户所确定的目标文档的文档格式为文本文档,那么显然,该多媒体元素的播放音视频的功能无法展现在目标文档中,即网页元素的元素功能丢失It should also be noted that when the determined document format of the target document is determined by the user's document format selection operation, in this case, element functions may be lost for webpage elements in the webpage to be converted, for example, for a webpage containing For the webpage to be converted with a multimedia webpage element, the document format of the target document determined by the user is a text document, then obviously, the function of playing audio and video of the multimedia element cannot be displayed in the target document, that is, the element function of the webpage element is lost
步骤S104b:对各网页元素:将该网页元素在所述待转换网页中的相对位置确定为该网页元素转换得到的文档元素在文档页面中的相对位置;Step S104b: For each web page element: determine the relative position of the web page element in the web page to be converted as the relative position of the document element converted from the web page element in the document page;
步骤S104c:对各网页元素:将该网页元素设置于所确定的文档格式的文档页面中的所确定的相对位置中,并进行保存。Step S104c: For each webpage element: set the webpage element at the determined relative position on the document page in the determined document format, and save it.
由此可见,应用本发明实施例可以直接对待转换的网页进行处理,无需用户手动反复采用粘贴复制的方法将网页本地化,因而能够将用户所浏览的网页内容自动快速地保存至本地文档,用户操作更便捷;另外,由于在对待转换网页的解析中还得到了网页元素的相对位置,并将该相对位置确定为文档元素在目标文档中的相对位置,进而设置转换后的网页元素,所以还保留了网页元素在待转换网页中的布局,因此,采用图3所示的具体实施方式使得转换后的目标文档更真实地将待转换网页保存至本地,用户使用更加方便。It can be seen that the application of the embodiment of the present invention can directly process the webpage to be converted, without the need for the user to manually repeatedly use the method of pasting and copying to localize the webpage, so that the content of the webpage browsed by the user can be automatically and quickly saved to the local file, and the user The operation is more convenient; in addition, because the relative position of the web page element is also obtained in the parsing of the web page to be converted, and the relative position is determined as the relative position of the document element in the target document, and then the converted web page element is set, so it is also The layout of webpage elements in the webpage to be converted is preserved. Therefore, the specific implementation shown in FIG. 3 makes the converted target document more realistically save the webpage to be converted locally, making it more convenient for users to use.
针对步骤S102中的另一种具体实施例,解析结果仅包括待转换网页中的网页元素的网页元素类型,参见图4,所述步骤S104,可以包括:For another specific embodiment in step S102, the analysis result only includes the webpage element type of the webpage element in the webpage to be converted, referring to FIG. 4, the step S104 may include:
步骤S104a:根据目标文档的文档元素类型与网页元素类型的预设类型对应关系,将所述待转换网页中的网页元素转换为类型相对应的文档元素;Step S104a: According to the preset type correspondence between the document element type of the target document and the webpage element type, convert the webpage element in the webpage to be converted into a document element corresponding to the type;
步骤S104d:按照网页元素读取顺序,将各网页元素转换得到的文档元素依次逐行排列在目标文档中,并进行保存。Step S104d: According to the reading order of the web page elements, the document elements converted from each web page element are arranged line by line in the target document and saved.
基于上述情况可知,在对待转换网页进行转换时,除了按照网页元素的类型进行了元素转换,也对网页元素的位置进行了设置,但是,由于网页元素是按照网页元素读取顺序依次排列的,因而,在对待转换网页进行转换的过程中,并没有保留网页元素在待转换网页中的布局。Based on the above situation, it can be seen that when converting the webpage to be converted, in addition to element conversion according to the type of the webpage element, the position of the webpage element is also set. However, since the webpage elements are arranged in sequence according to the reading order of the webpage elements, Therefore, during the process of converting the webpage to be converted, the layout of the webpage elements in the webpage to be converted is not preserved.
由此可见,应用本发明实施例可以直接对待转换的网页进行处理,无需用户手动反复采用粘贴复制的方法将网页本地化,因而能够将用户所浏览的网页内容自动快速地保存至本地文档,用户操作更便捷。It can be seen that the application of the embodiment of the present invention can directly process the webpage to be converted, without the need for the user to manually use the method of pasting and copying to localize the webpage, so that the content of the webpage browsed by the user can be automatically and quickly saved to the local file, and the user The operation is more convenient.
对应于上述方法实施例,本发明实施例还提供一种网页转换装置,参见图5,该装置可以包括:转换操作接收模块210、解析结果获得模块220、文档格式确定模块230和网页转换模块240;其中,Corresponding to the above-mentioned method embodiment, the embodiment of the present invention also provides a web page conversion device, referring to FIG. 5 , the device may include: a conversion operation receiving module 210, a parsing result obtaining module 220, a document format determining module 230 and a web page conversion module 240 ;in,
转换操作接收模块210,用于接收用户对待转换网页的转换操作。The conversion operation receiving module 210 is configured to receive the user's conversion operation on the webpage to be converted.
解析结果获得模块220,用于根据转换操作对待转换网页进行解析,获得解析结果,解析结果至少包括:待转换网页中的网页元素的网页元素类型。The parsing result obtaining module 220 is configured to parse the webpage to be converted according to the conversion operation, and obtain the parsing result, the parsing result at least includes: the webpage element type of the webpage element in the webpage to be converted.
文档格式确定模块230,用于确定进行转换后得到的目标文档的文档格式。The document format determination module 230 is configured to determine the document format of the target document obtained after conversion.
具体的,在本发明的一种实施例中,文档格式确定模块230,可以用于接收用户的文档格式选取操作;将用户所选取的文档格式确定进行转换后得到的目标文档的文档格式。Specifically, in an embodiment of the present invention, the document format determination module 230 may be configured to receive a user's document format selection operation; determine the document format of the target document obtained after converting the document format selected by the user.
具体的,在本发明的另一种实施方式中,文档格式确定模块230,可以用于:根据解析结果中的网页元素类型,确定进行转换后得到的目标文档的文档格式。Specifically, in another embodiment of the present invention, the document format determination module 230 may be configured to: determine the document format of the target document obtained after conversion according to the type of the webpage element in the parsing result.
参见图6,在本发明的另一种实施例中,文档格式确定模块230,可以包括:第一格式确定子模块230a、第二格式确定子模块230b、演示文稿确定子模块230c、表格确定子模块230d和文本文档确定子模块230e。Referring to Fig. 6, in another embodiment of the present invention, the document format determining module 230 may include: a first format determining submodule 230a, a second format determining submodule 230b, a presentation determining submodule 230c, a form determining submodule Module 230d and text document determination sub-module 230e.
其中,第一格式确定子模块230a,用于判断网页元素类型中是否具有多媒体元素,如果是,则触发演示文稿确定子模块230c,否则触发第二格式确定子模块230b;Wherein, the first format determination submodule 230a is used to judge whether there is a multimedia element in the webpage element type, if yes, then trigger the presentation document determination submodule 230c, otherwise trigger the second format determination submodule 230b;
演示文稿确定子模块230c,用于确定进行转换后得到的目标文档的文档格式为演示文稿;The presentation document determination sub-module 230c is used to determine that the document format of the converted target document is a presentation document;
第二格式确定子模块230b,用于判断网页元素类型中是否具有表格元素,如果是,则触发表格确定子模块230d,否则触发文本文档确定子模块230e;The second format determination sub-module 230b is used to judge whether there is a table element in the webpage element type, if yes, then trigger the form determination sub-module 230d, otherwise trigger the text document determination sub-module 230e;
表格确定子模块230d,用于确定进行转换后得到的目标文档的文档格式为电子表格;The form determination sub-module 230d is used to determine that the document format of the converted target document is an electronic form;
文本文档确定子模块230e,用于确定进行转换后得到的目标文档的文档格式为文本文档。The text document determination sub-module 230e is configured to determine that the document format of the converted target document is a text document.
网页转换模块240,用于根据目标文档的文档元素类型与网页元素类型的预设类型对应关系,将待转换网页中的网页元素转换为类型相对应的文档元素,并保存于所确定的文档格式的文档中。The web page conversion module 240 is configured to convert the web page elements in the web page to be converted into document elements corresponding to the type according to the preset type correspondence between the document element type of the target document and the web page element type, and save them in the determined document format in the document.
在本发明的一种具体实施方式中,解析结果获得模块220所获得的解析结果,除了获得的待转换网页中的网页元素的网页元素类型,还获得:待转换网页中的网页元素的坐标,参见图7,与图3所述的方法对应,在图5所示实施例的基础上,该装置还包括:第一相对位置确定模块250;In a specific implementation of the present invention, the analysis result obtained by the analysis result obtaining module 220, in addition to the obtained webpage element type of the webpage element in the webpage to be converted, also obtains: the coordinates of the webpage element in the webpage to be converted, Referring to FIG. 7, corresponding to the method described in FIG. 3, on the basis of the embodiment shown in FIG. 5, the device further includes: a first relative position determination module 250;
第一相对位置确定模块250,用于在网页转换模块240根据目标文档的文档元素类型与网页元素类型的预设类型对应关系,将待转换网页中的网页元素转换为类型相对应的文档元素,并保存于所确定的文档格式的文档中之前,根据解析结果中的网页元素的坐标确定网页元素在待转换网页中的相对位置;The first relative position determination module 250 is used to convert the webpage elements in the webpage to be converted into document elements corresponding to the type according to the preset type correspondence between the document element type of the target document and the webpage element type in the webpage conversion module 240, and before saving in the document in the determined document format, determine the relative position of the webpage element in the webpage to be converted according to the coordinates of the webpage element in the parsing result;
在图7所示实施例中,网页转换模块240,可以包括:网页元素转换子模块240a、第二相对位置确定子模块240b和文档保存子模块240c:In the embodiment shown in FIG. 7, the webpage conversion module 240 may include: a webpage element conversion submodule 240a, a second relative position determination submodule 240b, and a document preservation submodule 240c:
网页元素转换子模块240a,用于根据目标文档的文档元素类型与网页元素类型的预设类型对应关系,将待转换网页中的网页元素转换为类型相对应的文档元素;The webpage element conversion sub-module 240a is used to convert the webpage elements in the webpage to be converted into corresponding document elements according to the corresponding relationship between the document element type of the target document and the preset type of webpage element type;
第二相对位置确定子模块240b,用于将第一相对位置确定模块250所确定的各网页元素在待转换网页中的相对位置确定为该网页元素转换得到的文档元素在文档页面中的相对位置;The second relative position determination sub-module 240b is configured to determine the relative position of each web page element determined by the first relative position determination module 250 in the web page to be converted as the relative position of the document element converted from the web page element in the document page ;
文档保存子模块240c,用于将各网页元素设置于第二相对位置确定子模块所确定的文档格式的文档页面中的所确定的相对位置中,并进行保存。The document saving submodule 240c is configured to set and save each web page element at the determined relative position on the document page in the document format determined by the second relative position determining submodule.
应用本发明图7所示实施例可以直接对待转换的网页进行处理,无需用户手动反复采用粘贴复制的方法将网页本地化,因而能够将用户所浏览的网页内容自动快速地保存至本地文档,用户操作更便捷;另外,由于在对待转换网页的解析中还得到了网页元素的相对位置,并将该相对位置确定为文档元素在目标文档中的相对位置,进而设置转换后的网页元素,所以还保留了网页元素在待转换网页中的布局,因此,采用图7所示的具体实施方式使得转换后的目标文档更真实地将待转换网页保存至本地,用户使用更加方便。Applying the embodiment shown in Figure 7 of the present invention can directly process the webpage to be converted, without the need for the user to manually use the method of pasting and copying to localize the webpage, so that the content of the webpage browsed by the user can be automatically and quickly saved to the local file, and the user The operation is more convenient; in addition, because the relative position of the web page element is also obtained in the parsing of the web page to be converted, and the relative position is determined as the relative position of the document element in the target document, and then the converted web page element is set, so it is also The layout of webpage elements in the webpage to be converted is preserved. Therefore, the specific implementation shown in FIG. 7 makes the converted target document more realistically save the webpage to be converted locally, making it more convenient for users to use.
在本发明的另一种具体实施方式中,解析结果获得模块220所获得的解析结果仅包括网页元素类型,参见图8,与图4所述的方法对应,图5所示实施例中的网页转换模块240,可以包括:网页元素转换子模块240a和元素位置排列子模块240d。In another specific implementation of the present invention, the analysis result obtained by the analysis result obtaining module 220 only includes the element type of the webpage, referring to FIG. 8, corresponding to the method described in FIG. 4, the webpage in the embodiment shown in FIG. The converting module 240 may include: a webpage element converting submodule 240a and an element position arranging submodule 240d.
其中,网页元素转换子模块240a,用于根据目标文档的文档元素类型与网页元素类型的预设类型对应关系,将待转换网页中的网页元素转换为类型相对应的文档元素;Wherein, the webpage element conversion sub-module 240a is used to convert the webpage elements in the webpage to be converted into corresponding document elements according to the preset type correspondence between the document element type of the target document and the webpage element type;
元素位置排列子模块240d,用于按照网页元素读取顺序,将各网页元素转换得到的文档元素依次逐行排列在目标文档中,并进行保存。The element position arranging sub-module 240d is used for arranging the document elements converted from each web page element in the target document line by line according to the reading order of the web page elements, and saving them.
由图8所示实施例可知,在对待转换网页进行转换时,除了按照网页元素的类型进行了元素转换,也对网页元素的位置进行了设置,但是,由于网页元素是按照网页元素读取顺序依次排列的,因而,在对待转换网页进行转换的过程中,并没有保留网页元素在待转换网页中的布局。As can be seen from the embodiment shown in Figure 8, when the webpage to be converted is converted, in addition to element conversion according to the type of the webpage element, the position of the webpage element is also set. However, since the webpage elements are read according to the order of webpage elements Therefore, in the process of converting the webpage to be converted, the layout of the webpage elements in the webpage to be converted is not preserved.
由此可见,应用本发明实施例可以直接对待转换的网页进行处理,无需用户手动反复采用粘贴复制的方法将网页本地化,因而能够将用户所浏览的网页内容自动快速地保存至本地文档,用户操作更便捷。It can be seen that the application of the embodiment of the present invention can directly process the webpage to be converted, without the need for the user to manually use the method of pasting and copying to localize the webpage, so that the content of the webpage browsed by the user can be automatically and quickly saved to the local file, and the user The operation is more convenient.
对于系统或装置实施例而言,由于其基本相似于方法实施例,所以描述的比较简单,相关之处参见方法实施例的部分说明即可。As for the system or device embodiments, since they are basically similar to the method embodiments, the description is relatively simple, and for related parts, please refer to part of the description of the method embodiments.
需要说明的是,在本文中,诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。It should be noted that in this article, relational terms such as first and second are only used to distinguish one entity or operation from another entity or operation, and do not necessarily require or imply that there is a relationship between these entities or operations. There is no such actual relationship or order between them. Furthermore, the term "comprises", "comprises" or any other variation thereof is intended to cover a non-exclusive inclusion such that a process, method, article or apparatus comprising a set of elements includes not only those elements, but also includes elements not expressly listed. other elements of or also include elements inherent in such a process, method, article, or apparatus. Without further limitations, an element defined by the phrase "comprising a ..." does not exclude the presence of additional identical elements in the process, method, article or apparatus comprising said element.
本领域普通技术人员可以理解实现上述方法实施方式中的全部或部分步骤是可以通过程序来指令相关的硬件来完成,所述的程序可以存储于计算机可读取存储介质中,这里所称得的存储介质,如:ROM/RAM、磁碟、光盘等。Those of ordinary skill in the art can understand that all or part of the steps in the implementation of the above method can be completed by instructing related hardware through a program, and the program can be stored in a computer-readable storage medium, referred to herein as Storage media, such as: ROM/RAM, disk, CD, etc.
以上所述仅为本发明的较佳实施例而已,并非用于限定本发明的保护范围。凡在本发明的精神和原则之内所作的任何修改、等同替换、改进等,均包含在本发明的保护范围内。The above descriptions are only preferred embodiments of the present invention, and are not intended to limit the protection scope of the present invention. Any modification, equivalent replacement, improvement, etc. made within the spirit and principles of the present invention are included in the protection scope of the present invention.
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201510232575.0ACN106202005A (en) | 2015-05-08 | 2015-05-08 | Method and device for web page conversion |
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201510232575.0ACN106202005A (en) | 2015-05-08 | 2015-05-08 | Method and device for web page conversion |
| Publication Number | Publication Date |
|---|---|
| CN106202005Atrue CN106202005A (en) | 2016-12-07 |
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201510232575.0APendingCN106202005A (en) | 2015-05-08 | 2015-05-08 | Method and device for web page conversion |
| Country | Link |
|---|---|
| CN (1) | CN106202005A (en) |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN109542554A (en)* | 2018-10-26 | 2019-03-29 | 金蝶软件(中国)有限公司 | Method, apparatus, computer equipment and the storage medium of document layout conversion |
| CN110018984A (en)* | 2017-10-31 | 2019-07-16 | 北京国双科技有限公司 | A kind of conversion method and device of file format |
| CN110956016A (en)* | 2018-09-25 | 2020-04-03 | 珠海金山办公软件有限公司 | Document content format adjusting method and device and electronic equipment |
| CN111125587A (en)* | 2019-12-31 | 2020-05-08 | 北京百度网讯科技有限公司 | Webpage structure optimization method, device, equipment and storage medium |
| CN111125598A (en)* | 2019-12-20 | 2020-05-08 | 深圳壹账通智能科技有限公司 | Intelligent data query method, device, equipment and storage medium |
| CN112528612A (en)* | 2019-08-29 | 2021-03-19 | 小船出海教育科技(北京)有限公司 | Method, device, storage medium and processor for displaying webpage content in document |
| CN114491097A (en)* | 2021-12-20 | 2022-05-13 | 奇安信科技集团股份有限公司 | Method, device and device for converting web page into presentation PPT |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101079059A (en)* | 2007-03-27 | 2007-11-28 | 腾讯科技(深圳)有限公司 | System, method and browser for keeping web page content |
| WO2012076976A1 (en)* | 2010-12-08 | 2012-06-14 | N&N Chopra Consultants Pvt. Ltd. | System and method for integrating software functionalities on n-layer architecture platform |
| CN102737116A (en)* | 2012-05-29 | 2012-10-17 | 深圳市同洲电子股份有限公司 | Method and device for storing webpage resources |
| CN103631795A (en)* | 2012-08-22 | 2014-03-12 | 百度在线网络技术(北京)有限公司 | Method and device for converting webpages in network equipment and equipment |
| CN103870441A (en)* | 2012-12-14 | 2014-06-18 | 苏州精易会信息技术有限公司 | Method for converting webpage table data into Excel |
| CN104077292A (en)* | 2013-03-27 | 2014-10-01 | 腾讯科技(深圳)有限公司 | Webpage information storage method and equipment |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101079059A (en)* | 2007-03-27 | 2007-11-28 | 腾讯科技(深圳)有限公司 | System, method and browser for keeping web page content |
| WO2012076976A1 (en)* | 2010-12-08 | 2012-06-14 | N&N Chopra Consultants Pvt. Ltd. | System and method for integrating software functionalities on n-layer architecture platform |
| CN102737116A (en)* | 2012-05-29 | 2012-10-17 | 深圳市同洲电子股份有限公司 | Method and device for storing webpage resources |
| CN103631795A (en)* | 2012-08-22 | 2014-03-12 | 百度在线网络技术(北京)有限公司 | Method and device for converting webpages in network equipment and equipment |
| CN103870441A (en)* | 2012-12-14 | 2014-06-18 | 苏州精易会信息技术有限公司 | Method for converting webpage table data into Excel |
| CN104077292A (en)* | 2013-03-27 | 2014-10-01 | 腾讯科技(深圳)有限公司 | Webpage information storage method and equipment |
| Title |
|---|
| 杨欣: "《大学计算机基础》", 1 August 2012, 清华大学出版社* |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN110018984A (en)* | 2017-10-31 | 2019-07-16 | 北京国双科技有限公司 | A kind of conversion method and device of file format |
| CN110956016A (en)* | 2018-09-25 | 2020-04-03 | 珠海金山办公软件有限公司 | Document content format adjusting method and device and electronic equipment |
| CN110956016B (en)* | 2018-09-25 | 2024-03-22 | 珠海金山办公软件有限公司 | Document content format adjusting method and device and electronic equipment |
| CN109542554A (en)* | 2018-10-26 | 2019-03-29 | 金蝶软件(中国)有限公司 | Method, apparatus, computer equipment and the storage medium of document layout conversion |
| CN112528612A (en)* | 2019-08-29 | 2021-03-19 | 小船出海教育科技(北京)有限公司 | Method, device, storage medium and processor for displaying webpage content in document |
| CN112528612B (en)* | 2019-08-29 | 2024-03-22 | 小船出海教育科技(北京)有限公司 | Method, device, storage medium and processor for displaying webpage content in document |
| CN111125598A (en)* | 2019-12-20 | 2020-05-08 | 深圳壹账通智能科技有限公司 | Intelligent data query method, device, equipment and storage medium |
| CN111125587A (en)* | 2019-12-31 | 2020-05-08 | 北京百度网讯科技有限公司 | Webpage structure optimization method, device, equipment and storage medium |
| CN111125587B (en)* | 2019-12-31 | 2023-08-04 | 北京百度网讯科技有限公司 | Web page structure optimization method, device, equipment and storage medium |
| CN114491097A (en)* | 2021-12-20 | 2022-05-13 | 奇安信科技集团股份有限公司 | Method, device and device for converting web page into presentation PPT |
| Publication | Publication Date | Title |
|---|---|---|
| CN106202005A (en) | Method and device for web page conversion | |
| CN110083805B (en) | A method and system for converting a Word file into an EPUB file | |
| US10013730B2 (en) | Display method and display device | |
| WO2016019805A1 (en) | Method and apparatus for making and displaying expansion content of electronic book | |
| CN104079652B (en) | method for making and playing HTM (hypertext markup language) advertisement file | |
| US8635518B1 (en) | Methods and systems to copy web content selections | |
| CN107451113A (en) | Automatic typesetting method and system for presentation document | |
| KR20060046002A (en) | Method and system for content mapping between launch template and target template | |
| TWI571757B (en) | A webpage edition system and the method thereof and a computer program product for storing a webpage edition program | |
| CN105095160A (en) | Document conversion reading method and system | |
| CN104516861A (en) | Multimedia interactive document processing method | |
| CN108509504A (en) | Document online preview method, device, equipment, client and storage medium | |
| CN104898934A (en) | Method and device for determining automatic page turning time of electronic document | |
| CN103513875A (en) | Method for automatically spanning pages of electronic book | |
| CN105404612A (en) | Digital resource display method and system | |
| CN105630757B (en) | A kind of data editing method and device | |
| CN109032584A (en) | A kind of generation method of cascading style sheets, device, equipment and medium | |
| CN114781327A (en) | Method, device, electronic device and storage medium for processing digital teaching materials | |
| JP2007506387A5 (en) | ||
| CN108319576A (en) | A kind of method and device generating picture materials | |
| CN105786811B (en) | A Method and Device for Obtaining a Slide Layout Page | |
| TWI604319B (en) | Web page information synchronous browsing system and browsing method thereof | |
| CN109509464B (en) | Method and device for recording text reading as audio | |
| CN105867885B (en) | A kind of storage method and device of slide file | |
| CN104750669A (en) | To-be-pasted object processing method and to-be-pasted object processing device |
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| CB02 | Change of applicant information | Address after:Jinshan software building No. 8 Jingshan Hill Road, Lane 519015 Lianshan Jida Zhuhai city in Guangdong Province Applicant after:Zhuhai Kingsoft Office Software Co., Ltd. Applicant after:Beijing Kingsoft office software Limited by Share Ltd Applicant after:GUANGZHOU JINSHAN JINSHAN MOBILE TECHNOLOGY CO., LTD. Address before:Jinshan software building No. 8 Jingshan Hill Road, Lane 519015 Lianshan Jida Zhuhai city in Guangdong Province Applicant before:Zhuhai Kingsoft Office Software Co., Ltd. Applicant before:Beijing Kingsoft WPS Office Co., Ltd. Applicant before:GUANGZHOU JINSHAN JINSHAN MOBILE TECHNOLOGY CO., LTD. | |
| COR | Change of bibliographic data | ||
| RJ01 | Rejection of invention patent application after publication | Application publication date:20161207 | |
| RJ01 | Rejection of invention patent application after publication |