Movatterモバイル変換


[0]ホーム

URL:


CN107276659B - Voice talkback method and device and mobile terminal - Google Patents

Voice talkback method and device and mobile terminal
Download PDF

Info

Publication number
CN107276659B
CN107276659BCN201710439142.1ACN201710439142ACN107276659BCN 107276659 BCN107276659 BCN 107276659BCN 201710439142 ACN201710439142 ACN 201710439142ACN 107276659 BCN107276659 BCN 107276659B
Authority
CN
China
Prior art keywords
voice
file
voice file
server
algorithm
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710439142.1A
Other languages
Chinese (zh)
Other versions
CN107276659A (en
Inventor
张国滔
郑勇
魏科文
卫特超
郑培艺
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Waterward Information Co Ltd
Original Assignee
Shenzhen Water World Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Water World Co LtdfiledCriticalShenzhen Water World Co Ltd
Priority to CN201710439142.1ApriorityCriticalpatent/CN107276659B/en
Publication of CN107276659ApublicationCriticalpatent/CN107276659A/en
Priority to PCT/CN2017/109187prioritypatent/WO2018227854A1/en
Application grantedgrantedCritical
Publication of CN107276659BpublicationCriticalpatent/CN107276659B/en
Activelegal-statusCriticalCurrent
Anticipated expirationlegal-statusCritical

Links

Images

Classifications

Landscapes

Abstract

The invention discloses a voice talkback method, a device and a mobile terminal, wherein the method comprises the following steps: keeping long connection with a server through a satellite mobile communication network; compressing the collected voice information by using a low-rate voice coding algorithm and generating a first voice file; and sending the first voice file to the server so that the server sends the first voice file to a receiving end. The collected voice information is compressed by using the low-rate voice coding algorithm, so that the code rate of the voice information is greatly reduced, the capacity of a sent voice file is reduced, the bandwidth resource of a satellite mobile communication network is saved, and then the low-delay real-time talkback is realized.

Description

Voice talkback method and device and mobile terminal
Technical Field
The present invention relates to the field of communications technologies, and in particular, to a voice intercom method, apparatus, and mobile terminal.
Background
Mobile communication in a regional or global area is called satellite mobile communication by using a geostationary orbit satellite or a medium or low orbit satellite as a relay station. It generally comprises three parts: a communication satellite consisting of one or more satellites; a ground station including a system control center and a plurality of gateway stations (i.e., transit stations for connecting a public switched telephone network with mobile subscribers); the mobile user communication terminal comprises a vehicle-mounted terminal, a ship-based terminal, an airborne terminal and a handset. The users are free to move within the coverage area of the satellite beam, and the satellite transmits signals to maintain communication with the terrestrial communication system and the users of the private system or other mobile users.
Compared with other communication modes, the satellite mobile communication has the advantages of large coverage area, long communication distance, flexible communication, stable and reliable line and the like. Therefore, satellite mobile communication has become an important development direction of communication services.
With the rapid development of satellite mobile communication technology, more and more mobile terminals support satellite mobile communication. The satellite mobile communication network also supports the function of TCP/IP link access to the internet, as well as the public land mobile communication network, so that the mobile terminal can perform networking communication through the satellite mobile communication network. However, since the bandwidth of the satellite mobile communication network is narrow, real-time voice intercom cannot be realized by using the instant messaging application, thereby affecting user experience.
Disclosure of Invention
The invention mainly aims to provide a voice talkback method, a voice talkback device and a mobile terminal, and aims to solve the technical problem that the mobile terminal based on satellite mobile communication cannot utilize instant messaging application to realize real-time voice talkback.
To achieve the above object, an embodiment of the present invention provides a voice intercom method, including the following steps:
keeping long connection with a server through a satellite mobile communication network;
compressing the collected voice information by using a low-rate voice coding algorithm and generating a first voice file;
and sending the first voice file to the server so that the server sends the first voice file to an opposite terminal.
Optionally, the step of establishing a connection with a server through a satellite mobile communication network and maintaining a long connection further includes:
acquiring a second voice file sent by an opposite terminal from the server;
and outputting the second voice file.
Optionally, the step of obtaining, from the server, the second voice file sent by the peer end includes:
receiving a download address of the second voice file sent by the server;
and downloading the second voice file according to the download address.
Optionally, the step of outputting the second voice file includes:
judging whether the second voice file is a low-rate voice coding file or not;
and when the file is a low-speed voice coding file, decoding the second voice file by using a low-speed voice decoding algorithm and then playing the second voice file.
Optionally, the low-rate speech coding algorithm is an adaptive multi-rate AMR algorithm, a mixed excitation linear prediction coding (MELP) algorithm, a code excitation linear prediction Coding (CELP) algorithm, a Sinusoidal Transform Coding (STC) algorithm, a time-frequency domain interpolation coding (TFI) algorithm, a pitch synchronous excitation linear prediction coding (PSELP) algorithm, a multi-band excitation coding (MBE) algorithm, or a waveform interpolation coding (WI) algorithm.
Optionally, when the low-rate speech compression coding algorithm is an AMR algorithm, the step of compressing the collected speech information by using the low-rate speech compression coding algorithm and generating the speech file includes:
and carrying out compression coding on the acquired voice information by utilizing the AMR algorithm so as to reduce the code rate of the voice information to a preset value and generate a voice file in an AMR format.
Optionally, the preset value is 6.6 kb/s.
Optionally, when the low-rate speech compression coding algorithm is a MELP algorithm, the step of compressing the collected speech information by using the low-rate speech compression coding algorithm and generating the speech file includes:
and carrying out compression coding on the collected voice information by using the MELP algorithm so as to reduce the code rate of the voice information to 2.4kb/s and generate a voice file in the MELP format.
Optionally, the step of sending the voice file to the server includes:
and transmitting the voice file to the server by adopting a TCP/IP (Transmission control protocol/Internet protocol) sub-packet.
The embodiment of the invention also provides a voice intercom device, which comprises:
the connection module is used for keeping long connection with the server through a satellite mobile communication network;
the processing module is used for compressing the collected voice information by using a low-rate voice coding algorithm and generating a first voice file;
and the sending module is used for sending the first voice file to the server so that the server sends the first voice file to an opposite terminal.
Optionally, the apparatus further comprises:
the acquisition module is used for acquiring a second voice file sent by the opposite terminal from the server;
and the output module is used for outputting the second voice file.
Optionally, the obtaining module includes:
the receiving unit is used for receiving the download address of the second voice file sent by the server;
and the downloading unit is used for downloading the second voice file according to the downloading address.
Optionally, the output module includes:
the judging unit is used for judging whether the second voice file is a low-rate voice coding file or not;
and the playing unit is used for decoding the second voice file by using a low-rate voice decoding algorithm and then playing the second voice file when the second voice file is a low-rate voice coding file.
Optionally, when the low-rate speech compression coding algorithm is an AMR algorithm, the processing module is configured to: and carrying out compression coding on the acquired voice information by utilizing the AMR algorithm so as to reduce the code rate of the voice information to a preset value and generate a voice file in an AMR format.
Optionally, when the low-rate speech compression coding algorithm is a MELP algorithm, the processing module is configured to: and carrying out compression coding on the collected voice information by using the MELP algorithm so as to reduce the code rate of the voice information to 2.4kb/s and generate a voice file in the MELP format.
Optionally, the sending module is configured to: and transmitting the voice file to the server by adopting a TCP/IP (Transmission control protocol/Internet protocol) sub-packet.
The invention also proposes a mobile terminal comprising a memory, a processor and at least one application stored in said memory and configured to be executed by said processor, characterized in that said application is configured for executing the aforementioned voice intercom method.
According to the voice talkback method provided by the embodiment of the invention, the satellite mobile communication network is kept in long connection with the server, and the low-rate voice coding algorithm is utilized to compress the collected voice information, so that the code rate of the voice information is greatly reduced, the capacity of the sent voice file is reduced, the bandwidth resource of the satellite mobile communication network is saved, the low-delay real-time talkback is realized, the technical problem that the mobile terminal based on the satellite mobile communication cannot utilize the instant communication application to realize the real-time voice talkback in the prior art is solved, and the user experience is improved.
Drawings
FIG. 1 is a flow chart of a first embodiment of the voice intercom method of the present invention;
FIG. 2 is a flow chart of a second embodiment of the voice intercom method of the present invention;
FIG. 3 is a block diagram of a first embodiment of the speech communicator of the present invention;
FIG. 4 is a block diagram of a second embodiment of the speech communicator of the present invention;
FIG. 5 is a block diagram of an acquisition module of the voice intercom system of FIG. 4;
fig. 6 is a block diagram of an output module of the voice intercom device of fig. 4.
The implementation, functional features and advantages of the objects of the present invention will be further explained with reference to the accompanying drawings.
Detailed Description
It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
Reference will now be made in detail to embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein like or similar reference numerals refer to the same or similar elements or elements having the same or similar function throughout. The embodiments described below with reference to the drawings are illustrative only and should not be construed as limiting the invention.
As used herein, the singular forms "a", "an", "the" and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise. It will be further understood that the terms "comprises" and/or "comprising," when used in this specification, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof. It will be understood that when an element is referred to as being "connected" or "coupled" to another element, it can be directly connected or coupled to the other element or intervening elements may also be present. Further, "connected" or "coupled" as used herein may include wirelessly connected or wirelessly coupled. As used herein, the term "and/or" includes all or any element and all combinations of one or more of the associated listed items.
It will be understood by those skilled in the art that, unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. It will be further understood that terms, such as those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the prior art and will not be interpreted in an idealized or overly formal sense unless expressly so defined herein.
As will be appreciated by those skilled in the art, "terminal" as used herein includes both devices that are wireless signal receivers, devices that have only wireless signal receivers without transmit capability, and devices that include receive and transmit hardware, devices that have receive and transmit hardware capable of performing two-way communication over a two-way communication link. Such a device may include: a cellular or other communication device having a single line display or a multi-line display or a cellular or other communication device without a multi-line display; PCS (Personal Communications Service), which may combine voice, data processing, facsimile and/or data communication capabilities; a PDA (Personal Digital Assistant), which may include a radio frequency receiver, a pager, internet/intranet access, a web browser, a notepad, a calendar and/or a GPS (Global Positioning System) receiver; a conventional laptop and/or palmtop computer or other device having and/or including a radio frequency receiver. As used herein, a "terminal" or "terminal device" may be portable, transportable, installed in a vehicle (aeronautical, maritime, and/or land-based), or situated and/or configured to operate locally and/or in a distributed fashion at any other location(s) on earth and/or in space. As used herein, a "terminal Device" may also be a communication terminal, a web terminal, a music/video playing terminal, such as a PDA, an MID (Mobile Internet Device) and/or a Mobile phone with music/video playing function, or a smart tv, a set-top box, etc.
As used herein, a server, as will be understood by those skilled in the art, includes, but is not limited to, a computer, a network host, a single network server, a collection of network servers, or a cloud of servers. Here, the Cloud is composed of a large number of computers or network servers based on Cloud Computing (Cloud Computing), which is a kind of distributed Computing, a super virtual computer composed of a group of loosely coupled computer sets. In the embodiment of the present invention, the server, the terminal device and the WNS server may communicate with each other through any communication method, including but not limited to mobile communication based on 3GPP, LTE and WIMAX, computer network communication based on TCP/IP and UDP protocols, and short-range wireless transmission based on bluetooth and infrared transmission standards.
Referring to fig. 1, a first embodiment of the speech intercom method of the present invention is presented, said method comprising the steps of:
and S11, keeping long connection with the server through the satellite mobile communication network.
In step S11, after the mobile terminal establishes a connection with the server through the satellite mobile communication network, the mobile terminal maintains a long connection with the server at a certain heartbeat cycle, that is, the mobile terminal sends a heartbeat packet to the server every other heartbeat cycle, so as to maintain a connection path between the two, thereby implementing low-delay real-time transmission of subsequent voice packets.
Alternatively, the mobile terminal may maintain a long connection with the server at a preset heartbeat cycle.
Optionally, the mobile terminal may perform adaptive adjustment on the heartbeat cycle according to the reference heartbeat cycle and the signal quality of the satellite mobile communication network, obtain an adaptive heartbeat cycle which is relatively long and can maintain stable connection, and maintain long connection with the server according to the adaptive heartbeat cycle. The reference heartbeat period may be a preset heartbeat period, a heartbeat period used in the last connection, a heartbeat period used by other mobile communication networks (such as a public land mobile communication network), and the like.
For example, the mobile terminal first performs a long connection test with a reference heartbeat period. When the long connection can be maintained in the reference heartbeat period, the duration is increased on the basis of the reference heartbeat period to perform the long connection test, and the self-adaptive heartbeat period capable of maintaining the long connection is obtained, for example: gradually increasing the duration on the basis of the reference heartbeat period to perform a long connection test until the long connection can not be maintained; when the long connection cannot be maintained, the heartbeat cycle of the previous test is selected as the self-adaptive heartbeat cycle. When the long connection cannot be maintained with reference to the heartbeat cycle, the duration is reduced on the basis of the reference heartbeat cycle to perform the long connection test, and the adaptive heartbeat cycle capable of maintaining the long connection is obtained, for example: gradually reducing the duration on the basis of the reference heartbeat period to perform a long connection test until the long connection can be maintained; and when the long connection can be maintained, selecting the heartbeat cycle of the test as the self-adaptive heartbeat cycle. Finally, the mobile terminal maintains a long connection with the server with a self-adaptive heartbeat cycle.
In the embodiment of the invention, the mobile terminal can be a satellite mobile communication terminal only supporting satellite mobile communication, or a convergence terminal of satellite mobile communication and public land mobile communication supporting both satellite mobile communication and public land mobile communication.
And S12, processing the collected voice information by using a low-rate voice coding algorithm and generating a first voice file.
In the embodiment of the invention, the mobile terminal can carry out voice talkback with other terminals through instant messaging applications such as WeChat, easy-to-believe, QQ and the like, and the other terminals are opposite terminals of the mobile terminal. When the mobile terminal sends a voice file to the opposite terminal, the mobile terminal is a sending terminal, and the opposite terminal is a receiving terminal; when the mobile terminal receives the voice file sent by the opposite terminal, the mobile terminal is a receiving terminal, and the opposite terminal is a sending terminal.
When the mobile terminal serves as a sending end, voice information is collected through a microphone, the collected voice information is processed through a low-rate voice coding algorithm, and a first voice file is generated.
Optionally, when the mobile terminal collects the voice information, the application processor collects the voice information by using the 8-bit ADC and the 8k sampling frequency, and performs digital recording on the collected voice information. The code rate of the collected voice information is 64 kb/s.
The low-rate speech coding algorithm may be any one of speech coding algorithms such as an adaptive multi-rate (AMR) algorithm, a mixed excitation linear prediction coding (MELP) algorithm, a code excited linear prediction Coding (CELP) algorithm, a Sinusoidal Transform Coding (STC) algorithm, a time-frequency domain interpolation coding (TFI) algorithm, a pitch synchronous excitation linear prediction coding (PSELP) algorithm, a multi-band excitation coding (MBE) algorithm, a waveform interpolation coding (WI) algorithm, and the like.
For example, taking the AMR algorithm as an example, the mobile terminal performs compression coding on the collected voice information by using the AMR algorithm to reduce the code rate of the voice information to a preset value, and generates a voice file in the AMR format. AMR can adopt nine codes from 6.6kb/s to 23.85kb/s, and the lowest code rate of 6.6kb/s is preferred to be a preset value. Therefore, the code rate of the voice information is greatly reduced, the capacity of the voice file is reduced, the bandwidth resource of the satellite mobile communication network is saved, and the low-delay real-time talkback is realized.
For another example, taking the MELP algorithm as an example, the mobile terminal performs compression coding on the collected voice information by using the MELP algorithm, so as to reduce the code rate of the voice information to 2.4kb/s, and generate a voice file in the MELP format. Therefore, the code rate of the voice information is greatly reduced, the capacity of the voice file is reduced, the bandwidth resource of the satellite mobile communication network is saved, and the low-delay real-time talkback is realized.
And S13, sending the first voice file to the server so that the server sends the first voice file to the opposite terminal.
In the embodiment of the invention, a satellite communication modem (modem) of the mobile terminal establishes communication with the server through a socket (socket), and the mobile terminal preferably adopts a transmission control protocol/internet protocol (TCP/IP) protocol sub-packet to transmit the voice file to the server. That is, the mobile terminal divides the first voice file into a plurality of voice packets, and sequentially transmits the plurality of voice packets to the server in order. After receiving a plurality of voice packets, the server stores the voice packets into the cache in sequence according to the starting identifier and the ending identifier of the voice packets to form a voice file, namely, the first voice file is restored.
Each voice packet is a TCP/IP protocol packet, and the TCP/IP protocol packet comprises the following components:
-a pack head-a pack length-a pack body
The mobile terminal and the server may agree on the definition of the packet header (for example, set different identifiers), and the server analyzes the packet header of the TCP/IP protocol packet to distinguish whether the network transmitting the voice file is a satellite mobile communication network or a public land mobile communication network, that is, whether the voice file sent by the sending end is a low-rate voice coding file or a common voice coding file.
The server can adopt a software architecture supporting concurrent access of a plurality of clients, such as MINA, Erlang and the like, and supports multi-user high-concurrency access to the server. For example, a multi-threaded mechanism is employed, with one thread for listening to client requests and multiple threads for handling multiple user concurrent requests.
The specific process of the server side is as follows: creating server end object to generate monitoring thread, starting port monitoring, starting receiving client end connection request, creating client end object to generate new thread when client end connection comes. Sending data to a client, creating a data stream transmission object, starting data interception, and judging the data length when receiving the data. When the data length is 0, judging that the connection is disconnected, and deleting the client object and the useless thread; when the data length is not 0, the data is processed.
After receiving the first voice file, the server may send the voice file to the receiving end in the following two ways: one is that a download address of the first voice file is sent to the receiving end, so that the receiving end can directly download the first voice file according to the download address; and the other method is to adopt TCP/IP protocol to transmit the first voice file to the receiving end by sub-package.
If the receiving end is accessed to the satellite mobile communication network, the downloading mode is preferentially adopted, so that the time delay can be reduced. If the receiving end is accessed to the public land mobile communication network, the two modes can be both.
According to the voice talkback method, the satellite mobile communication network is in long connection with the server, and the low-rate voice coding algorithm is used for compressing the collected voice information, so that the code rate of the voice information is greatly reduced, the capacity of the sent voice file is reduced, the bandwidth resource of the satellite mobile communication network is saved, the low-delay real-time talkback is realized, the technical problem that a mobile terminal based on satellite mobile communication cannot realize real-time voice talkback by using instant communication application in the prior art is solved, and the user experience is improved.
Further, as shown in fig. 2, in the second embodiment of the voice intercom method of the present invention, when the mobile terminal is used as a receiving end, step S11 is followed by:
and S14, acquiring the second voice file sent by the opposite terminal from the server.
In the embodiment of the invention, the server preferably sends the download address of the second voice file to the mobile terminal, and the mobile terminal receives the download address sent by the server and downloads the second voice file according to the download address. The second voice file is acquired in a downloading mode, so that the time delay of voice talkback can be reduced, and the user experience is improved.
In other embodiments, the server may also use TCP/IP protocol to packet-transmit the second voice file to the mobile terminal, that is, the server divides the second voice file into a plurality of voice packets, and sequentially transmits the plurality of voice packets to the mobile terminal. After receiving a plurality of voice packets, the mobile terminal stores the voice packets into the cache in sequence according to the starting identifier and the ending identifier of the voice packets to form a voice file, namely, a second voice file is restored.
And S15, outputting the second voice file.
In the embodiment of the invention, after receiving the second voice file, the mobile terminal firstly judges whether the second voice file is a low-rate voice coding file; when the file is a low-speed voice coding file, decoding the second voice file by a low-speed voice decoder by using a low-speed voice decoding algorithm and then playing the second voice file; when the file is a common voice coding file, the second voice file is decoded by a broadband voice decoder and then played.
The mobile terminal may determine whether the second voice file is a low-rate voice encoded file according to the identification information of the second voice file, where the identification information may be set in the header of the voice packet of the second voice file.
For example, when the identification information of the second voice file is the first identification, the second voice file is judged to be a low-rate voice coding file; and when the identification information of the second voice file is the second identification, judging that the second voice file is a common voice coding file.
For another example, when the identification information of the second voice file is the first identification, the second voice file is determined to be a low-rate voice coding file; when the identification information of the second voice file is empty (i.e. no identification), the second voice file is determined to be a normal voice coding file. Or vice versa.
According to the voice talkback method, the second voice file is obtained in a downloading mode, so that the time delay of voice talkback is reduced, and the user experience is improved. And decoding the second voice file by using a low-rate voice decoding algorithm, so that voice talkback with a mobile terminal accessed to the satellite mobile communication network is realized.
In the embodiment of the invention, the mobile terminal accessed to the satellite mobile communication network can carry out voice talkback with other mobile terminals accessed to the satellite mobile communication network and can also carry out voice talkback with other mobile terminals accessed to the public land mobile communication network.
Referring to fig. 3, a first embodiment of the voice intercom apparatus of the present invention is proposed, the apparatus is applied to a mobile terminal, and may be applied to other terminal devices, of course, the apparatus includes aconnection module 10, aprocessing module 20 and a sendingmodule 30, wherein:
the connection module 10: for maintaining a long connection with a server through a satellite mobile communication network.
In the embodiment of the present invention, after theconnection module 10 establishes a connection with the server through the satellite mobile communication network, the connection module maintains a long connection with the server at a certain heartbeat cycle, that is, theconnection module 10 sends a heartbeat packet to the server every other heartbeat cycle, so as to maintain a connection path between the two, thereby implementing low-delay real-time transmission of subsequent voice packets.
Alternatively, theconnection module 10 may maintain a long connection with the server at a preset heartbeat cycle.
Optionally, theconnection module 10 may perform adaptive adjustment on the heartbeat cycle according to the reference heartbeat cycle and the signal quality of the satellite mobile communication network, obtain an adaptive heartbeat cycle with a relatively large cycle and capable of maintaining stable connection, and maintain long connection with the server according to the adaptive heartbeat cycle. The reference heartbeat period may be a preset heartbeat period, a heartbeat period used in the last connection, a heartbeat period used by other mobile communication networks (such as a public land mobile communication network), and the like.
For example, theconnection module 10 first performs a long connection test with reference to the heartbeat cycle. When the long connection can be maintained in the reference heartbeat period, the duration is increased on the basis of the reference heartbeat period to perform the long connection test, and the self-adaptive heartbeat period capable of maintaining the long connection is obtained, for example: gradually increasing the duration on the basis of the reference heartbeat period to perform a long connection test until the long connection can not be maintained; when the long connection cannot be maintained, the heartbeat cycle of the previous test is selected as the self-adaptive heartbeat cycle. When the long connection cannot be maintained with reference to the heartbeat cycle, the duration is reduced on the basis of the reference heartbeat cycle to perform the long connection test, and the adaptive heartbeat cycle capable of maintaining the long connection is obtained, for example: gradually reducing the duration on the basis of the reference heartbeat period to perform a long connection test until the long connection can be maintained; and when the long connection can be maintained, selecting the heartbeat cycle of the test as the self-adaptive heartbeat cycle. Finally, theconnection module 10 maintains a long connection with the server with an adaptive heartbeat cycle.
In the embodiment of the invention, the mobile terminal can be a satellite mobile communication terminal only supporting satellite mobile communication, or a convergence terminal of satellite mobile communication and public land mobile communication supporting both satellite mobile communication and public land mobile communication.
The processing module 20: the voice processing device is used for compressing the collected voice information by using a low-rate voice coding algorithm and generating a first voice file.
In the embodiment of the invention, the voice intercom device can perform voice intercom with other terminals through instant messaging applications such as WeChat, easy-to-communicate, QQ and the like, and the other terminals are opposite terminals of the mobile terminal. When the mobile terminal sends a voice file to the opposite terminal, the mobile terminal is a sending terminal, and the opposite terminal is a receiving terminal; when the mobile terminal receives the voice file sent by the opposite terminal, the mobile terminal is a receiving terminal, and the opposite terminal is a sending terminal.
When the mobile terminal is used as a transmitting end, theprocessing module 20 collects voice information through a microphone, and processes the collected voice information by using a low-rate voice coding algorithm to generate a first voice file.
Optionally, when theprocessing module 20 collects the voice information, the application processor collects the voice information by using 8-bit ADCs and 8k sampling frequencies, and performs digital recording on the collected voice information. The code rate of the collected voice information is 64 kb/s.
The low-rate speech coding algorithm may be any one of speech coding algorithms such as an adaptive multi-rate (AMR) algorithm, a mixed excitation linear prediction coding (MELP) algorithm, a code excited linear prediction Coding (CELP) algorithm, a Sinusoidal Transform Coding (STC) algorithm, a time-frequency domain interpolation coding (TFI) algorithm, a pitch synchronous excitation linear prediction coding (PSELP) algorithm, a multi-band excitation coding (MBE) algorithm, a waveform interpolation coding (WI) algorithm, and the like.
For example, taking the AMR algorithm as an example, theprocessing module 20 performs compression coding on the collected voice information by using the AMR algorithm to reduce the code rate of the voice information to a preset value, and generates a voice file in the AMR format. AMR can adopt nine codes from 6.6kb/s to 23.85kb/s, and the lowest code rate of 6.6kb/s is preferred to be a preset value. Therefore, the code rate of the voice information is greatly reduced, the capacity of the voice file is reduced, the bandwidth resource of the satellite mobile communication network is saved, and the low-delay real-time talkback is realized.
For another example, taking the MELP algorithm as an example, theprocessing module 20 performs compression coding on the collected voice information by using the MELP algorithm, so as to reduce the code rate of the voice information to 2.4kb/s, and generate a voice file in the MELP format. Therefore, the code rate of the voice information is greatly reduced, the capacity of the voice file is reduced, the bandwidth resource of the satellite mobile communication network is saved, and the low-delay real-time talkback is realized.
The sending module 30: the server is used for sending the first voice file to the server so as to enable the server to send the first voice file to the opposite terminal.
In the embodiment of the present invention, the sendingmodule 30 preferably uses TCP/IP protocol to transmit the voice file to the server by packetization. That is, the transmittingmodule 30 divides the first voice file into a plurality of voice packets, and sequentially transmits the plurality of voice packets to the server in order. After receiving a plurality of voice packets, the server stores the voice packets into the cache in sequence according to the starting identifier and the ending identifier of the voice packets to form a voice file, namely, the first voice file is restored.
Each voice packet is a TCP/IP protocol packet, and the TCP/IP protocol packet comprises the following components:
-a pack head-a pack length-a pack body
The sendingmodule 30 and the server may agree on the definition of the packet header (for example, set different identifiers), and the server analyzes the packet header of the TCP/IP protocol packet to distinguish whether the network transmitting the voice file is a satellite mobile communication network or a public land mobile communication network, that is, to distinguish whether the voice file sent by the sending end is a low-rate voice coding file or a normal voice coding file.
The voice intercom device provided by the embodiment of the invention keeps long connection with the server through the satellite mobile communication network, and compresses the acquired voice information by using the low-rate voice coding algorithm, so that the code rate of the voice information is greatly reduced, the capacity of a sent voice file is reduced, the bandwidth resource of the satellite mobile communication network is saved, the low-delay real-time intercom is realized, the technical problem that a mobile terminal based on satellite mobile communication cannot realize the real-time voice intercom by using the instant communication application in the prior art is solved, and the user experience is improved.
Further, as shown in fig. 4, in the second embodiment of the voice intercom device of the present invention, the device further includes an obtainingmodule 40 and anoutputting module 50, where the obtainingmodule 40 is configured to obtain the second voice file sent by the opposite end from the server, and theoutputting module 50 is configured to output the second voice file.
In the embodiment of the present invention, the server preferably sends the download address of the second voice file to the mobile terminal. At this time, the obtainingmodule 40 includes, as shown in fig. 5, a receivingunit 41 and a downloadingunit 42, where the receivingunit 41 is configured to receive a downloading address of the second voice file sent by the server, and the downloadingunit 42 is configured to download the second voice file according to the downloading address.
In other embodiments, the server may also use TCP/IP protocol to packet-transmit the second voice file to the mobile terminal, that is, the server divides the second voice file into a plurality of voice packets, and sequentially transmits the plurality of voice packets to the mobile terminal. The obtainingmodule 40 receives a plurality of voice packets, and sequentially stores the voice packets in the buffer memory according to the start identifier and the end identifier of the voice packets to form a voice file, i.e. a second voice file is restored.
As shown in fig. 6, theoutput module 50 includes a judgingunit 51 and aplaying unit 52, wherein: the judgingunit 51 is configured to judge whether the second voice file is a low-rate voice encoding file; the playingunit 52 is configured to decode the second voice file through the low-rate voice decoder by using a low-rate voice decoding algorithm and then play the second voice file when the second voice file is a low-rate voice encoded file; and when the second voice file is the common voice coding file, the second voice file is decoded by a broadband voice decoder and then played.
The determiningunit 51 may determine whether the second voice file is a low-rate voice encoded file by using identification information of the second voice file, where the identification information may be set in a header of a voice packet of the second voice file.
For example, when the identification information of the second voice file is the first identification, the judgingunit 51 judges that the second voice file is a low-rate voice encoding file; when the identification information of the second voice file is the second identification, thejudgment unit 51 judges that the second voice file is the normal voice encoding file.
For another example, when the identification information of the second voice file is the first identification, the determiningunit 51 determines that the second voice file is a low-rate voice encoded file; when the identification information of the second voice file is empty (i.e., no identification), thejudgment unit 51 judges that the second voice file is a normal voice encoded file. Or vice versa.
The voice talkback device of the embodiment acquires the second voice file in a downloading mode, reduces the time delay of voice talkback and improves the user experience. And decoding the second voice file by using a low-rate voice decoding algorithm, so that voice talkback with a mobile terminal accessed to the satellite mobile communication network is realized.
The invention also proposes a mobile terminal comprising a memory, a processor and at least one application stored in the memory and configured to be executed by the processor, the application being configured to perform a voice intercom method. The voice intercom method comprises the following steps: keeping long connection with a server through a satellite mobile communication network; compressing the collected voice information by using a low-rate voice coding algorithm and generating a first voice file; and sending the first voice file to the server so that the server sends the first voice file to a receiving end. The voice intercom method described in this embodiment is the voice intercom method related to the above embodiment of the present invention, and is not described herein again.
Those skilled in the art will appreciate that the present invention includes apparatus directed to performing one or more of the operations described in the present application. These devices may be specially designed and manufactured for the required purposes, or they may comprise known devices in general-purpose computers. These devices have stored therein computer programs that are selectively activated or reconfigured. Such a computer program may be stored in a device (e.g., computer) readable medium, including, but not limited to, any type of disk including floppy disks, hard disks, optical disks, CD-ROMs, and magnetic-optical disks, ROMs (Read-Only memories), RAMs (random access memories), EPROMs (Erasable Programmable Read-Only memories), EEPROMs (Electrically Erasable Programmable Read-Only memories), flash memories, magnetic cards, or optical cards, or any type of media suitable for storing electronic instructions, and each coupled to a bus. That is, a readable medium includes any medium that stores or transmits information in a form readable by a device (e.g., a computer).
It will be understood by those within the art that each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations, can be implemented by computer program instructions. Those skilled in the art will appreciate that the computer program instructions may be implemented by a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, implement the features specified in the block or blocks of the block diagrams and/or flowchart illustrations of the present disclosure.
Those of skill in the art will appreciate that various operations, methods, steps in the processes, acts, or solutions discussed in the present application may be alternated, modified, combined, or deleted. Further, various operations, methods, steps in the flows, which have been discussed in the present application, may be interchanged, modified, rearranged, decomposed, combined, or eliminated. Further, steps, measures, schemes in the various operations, methods, procedures disclosed in the prior art and the present invention can also be alternated, changed, rearranged, decomposed, combined, or deleted.
The above description is only a preferred embodiment of the present invention, and not intended to limit the scope of the present invention, and all modifications of equivalent structures and equivalent processes, which are made by using the contents of the present specification and the accompanying drawings, or directly or indirectly applied to other related technical fields, are included in the scope of the present invention.

Claims (10)

CN201710439142.1A2017-06-122017-06-12Voice talkback method and device and mobile terminalActiveCN107276659B (en)

Priority Applications (2)

Application NumberPriority DateFiling DateTitle
CN201710439142.1ACN107276659B (en)2017-06-122017-06-12Voice talkback method and device and mobile terminal
PCT/CN2017/109187WO2018227854A1 (en)2017-06-122017-11-02Speech talkback method and apparatus, and mobile terminal

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN201710439142.1ACN107276659B (en)2017-06-122017-06-12Voice talkback method and device and mobile terminal

Publications (2)

Publication NumberPublication Date
CN107276659A CN107276659A (en)2017-10-20
CN107276659Btrue CN107276659B (en)2020-10-09

Family

ID=60067421

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN201710439142.1AActiveCN107276659B (en)2017-06-122017-06-12Voice talkback method and device and mobile terminal

Country Status (2)

CountryLink
CN (1)CN107276659B (en)
WO (1)WO2018227854A1 (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN107276659B (en)*2017-06-122020-10-09深圳市沃特沃德股份有限公司Voice talkback method and device and mobile terminal
CN109462541A (en)*2018-10-152019-03-12世讯卫星技术有限公司A kind of satellite-based instantaneous communication system and method
CN109769214A (en)*2018-12-262019-05-17彩讯科技股份有限公司A kind of information push method, device, terminal and medium
CN109818832A (en)*2019-03-072019-05-28南京文卓星辉科技有限公司Public network talk back equipment, the heartbeat processing method of public network talk back equipment and medium
CN111161738A (en)*2019-12-272020-05-15苏州欧孚网络科技股份有限公司Voice file retrieval system and retrieval method thereof
CN112767936B (en)*2021-01-062024-07-12镁佳(北京)科技有限公司Voice dialogue method and device, storage medium and electronic equipment
CN114679215B (en)*2022-05-302022-09-16中移(上海)信息通信科技有限公司Communication method, device and satellite convergence gateway
CN117835358B (en)*2024-03-062024-05-03中国电子科技集团公司第五十四研究所 A voice rate adaptive method for satellite mobile communication system
CN118869666B (en)*2024-09-202025-01-03中国电信股份有限公司Voice communication method, device, electronic equipment, medium and program product

Citations (3)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN201655293U (en)*2009-11-302010-11-24成都林海电子有限责任公司Audio codec integrated circuit of satellite mobile communication voice coder
CN102769490A (en)*2011-05-052012-11-07中国科学院国家天文台 A Very Low Rate Satellite Mobile Voice Communication System
CN106374991A (en)*2016-08-292017-02-01北京华力创通科技股份有限公司Method, device and system for performing data transmission by utilizing Beidou satellite

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US8346544B2 (en)*2006-01-202013-01-01Qualcomm IncorporatedSelection of encoding modes and/or encoding rates for speech compression with closed loop re-decision
CN105118513B (en)*2015-07-222018-12-28重庆邮电大学A kind of 1.2kb/s low bit rate speech coding method based on mixed excitation linear prediction MELP
CN105577557B (en)*2015-12-182018-10-09中南大学Concurrent TCP connection multiplexing method in a kind of satellite network
CN106788679A (en)*2016-12-212017-05-31福建星海通信科技有限公司A kind of Beidou satellite communication device and method with speech identifying function
CN107276659B (en)*2017-06-122020-10-09深圳市沃特沃德股份有限公司Voice talkback method and device and mobile terminal

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN201655293U (en)*2009-11-302010-11-24成都林海电子有限责任公司Audio codec integrated circuit of satellite mobile communication voice coder
CN102769490A (en)*2011-05-052012-11-07中国科学院国家天文台 A Very Low Rate Satellite Mobile Voice Communication System
CN106374991A (en)*2016-08-292017-02-01北京华力创通科技股份有限公司Method, device and system for performing data transmission by utilizing Beidou satellite

Also Published As

Publication numberPublication date
CN107276659A (en)2017-10-20
WO2018227854A1 (en)2018-12-20

Similar Documents

PublicationPublication DateTitle
CN107276659B (en)Voice talkback method and device and mobile terminal
US11227612B2 (en)Audio frame loss and recovery with redundant frames
US7804819B2 (en)Method for implementing voice over IP through an electronic device connected to a packed switched network
JP5746392B2 (en) System and method for transmitting content from a mobile device to a wireless display
EP2320652B1 (en)Method and apparatus for communicating compressed video information
CN101490992B (en)Encoder initialization and communications
US20050165911A1 (en)Multimedia distributing and/or playing systems and methods using separate resolution-enhancing supplemental data
CN107196908A (en)adaptive streaming multimedia
CN102474396B (en) Efficient error correction scheme for data transmission in wireless in-band signaling systems
CN102098553B (en)Video transmission method and system as well as video server
US20120303833A1 (en)Methods for transmitting and receiving a digital signal, transmitter and receiver
WO2014105383A1 (en)Method and system for adaptive video transmission
US20110116583A1 (en)System and method for encoding and decoding information signals using a priori information
US20250090963A1 (en)Efficient packet-loss protected data encoding and/or decoding
CN113873486B (en)Bluetooth communication method, communication equipment and system
WO2014050546A1 (en)Method for transmitting audio information and packet communication system
US6707826B1 (en)Method and apparatus for wireless bandwidth efficient multi-way calling
CN102594494A (en)Intelligent terminal-oriented progressive network adaptive transmission method
US20140095154A1 (en)Voice transmitting device, voice transmitting method, voice receiving device, and voice receiving method
JP4182347B2 (en) Image data communication system and image data communication method
CN105491255A (en)Method and system for decreasing voice transmission load
KR101527166B1 (en)Apparatus and method for communicating a heterogeneous data over a voice channel
US20250104723A1 (en)Bundled multi-rate feedback autoencoder
KR101478029B1 (en)Data download and upload method of portable terminals using multi-communication network and portable neighbour terminals
US20150063103A1 (en)Bandwidth-dependent compressor for robust header compression and method of use thereof

Legal Events

DateCodeTitleDescription
PB01Publication
PB01Publication
SE01Entry into force of request for substantive examination
SE01Entry into force of request for substantive examination
GR01Patent grant
GR01Patent grant
TR01Transfer of patent right
TR01Transfer of patent right

Effective date of registration:20220520

Address after:518000 floor 1, building 3, Dexin Chang wisdom Park, No. 23 Heping Road, Qinghua community, Longhua street, Longhua District, Shenzhen, Guangdong

Patentee after:Shenzhen waterward Information Co.,Ltd.

Address before:518000 B, 503, 602, digital city building, garden city, 1079 Shekou Road, Shekou, Nanshan District, Shenzhen, Guangdong.

Patentee before:SHENZHEN WATER WORLD Co.,Ltd.


[8]ページ先頭

©2009-2025 Movatter.jp