Movatterモバイル変換


[0]ホーム

URL:


CN109525844A - A kind of acceleration system and method for multi-channel video encoding and decoding - Google Patents

A kind of acceleration system and method for multi-channel video encoding and decoding
Download PDF

Info

Publication number
CN109525844A
CN109525844ACN201910090918.2ACN201910090918ACN109525844ACN 109525844 ACN109525844 ACN 109525844ACN 201910090918 ACN201910090918 ACN 201910090918ACN 109525844 ACN109525844 ACN 109525844A
Authority
CN
China
Prior art keywords
chip
decoding
video encoding
camera
fpga
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910090918.2A
Other languages
Chinese (zh)
Other versions
CN109525844B (en
Inventor
宗艳艳
贡维
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhengzhou Yunhai Information Technology Co Ltd
Original Assignee
Zhengzhou Yunhai Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhengzhou Yunhai Information Technology Co LtdfiledCriticalZhengzhou Yunhai Information Technology Co Ltd
Priority to CN201910090918.2ApriorityCriticalpatent/CN109525844B/en
Publication of CN109525844ApublicationCriticalpatent/CN109525844A/en
Application grantedgrantedCritical
Publication of CN109525844BpublicationCriticalpatent/CN109525844B/en
Activelegal-statusCriticalCurrent
Anticipated expirationlegal-statusCritical

Links

Classifications

Landscapes

Abstract

Translated fromChinese

本发明提出了一种多路视频编解码的加速系统和方法,该系统包括Camera、Server和加速卡。加速卡包括BMC控制模块、芯片处理模块和电源模块;BMC控制模块用于带外监控管理,芯片处理模块用于进行多路视频编解码,电源模块为BMC控制模块和芯片处理模块供电。其中芯片处理模块包括2、3或4个FPGA芯片,FPGA芯片分别连接两个DDR4内存条、JTAG电路、SPI Flash和EMMC,各个FPGA芯片之间采用Chiplink互联,而且FPGA芯片均连接至Switch。基于该加速系统,还提出了多路视频编解码的加速方法。本发明使用FPGA芯片实现,不使用带有视频编解码功能的CPU,比如intel的E3,配合英伟达的P4卡,在成本上大幅度减少。另外使用2个、3个或者3个FPGA芯片,可以根据实际需要提高视频编解码的速度。

The present invention provides a multi-channel video encoding and decoding acceleration system and method, the system includes a Camera, a Server and an acceleration card. The accelerator card includes a BMC control module, a chip processing module and a power supply module; the BMC control module is used for out-of-band monitoring and management, the chip processing module is used for multi-channel video encoding and decoding, and the power supply module supplies power to the BMC control module and the chip processing module. The chip processing module includes 2, 3 or 4 FPGA chips. The FPGA chips are respectively connected to two DDR4 memory sticks, JTAG circuits, SPI Flash and EMMC. Chiplink is used to interconnect each FPGA chip, and the FPGA chips are all connected to the Switch. Based on the acceleration system, an acceleration method for multi-channel video encoding and decoding is also proposed. The present invention is realized by using FPGA chip, and does not use CPU with video encoding and decoding function, such as Intel's E3, and cooperates with NVIDIA's P4 card, which greatly reduces the cost. In addition, using 2, 3 or 3 FPGA chips can improve the speed of video encoding and decoding according to actual needs.

Description

A kind of acceleration system and method for multi-channel video encoding and decoding
Technical field
The present invention relates to server video encoding and decoding technique fields, specifically provide a kind of acceleration of multi-channel video encoding and decodingSystem and method.
Background technique
In recent years, the life of the continuous development let us of network technology is more and more abundant colorful.There is this convenience of networkCarrier, the progress that multimedia technology is also maked rapid progress, as the core and key of multimedia technology, multimedia videoEncoding and decoding all achieve major progress in technology and application aspect in recent years.The main function of Video coding is by video image prime numberIt is collapsed into video code flow according to RGB or YUV etc., to reduce the data volume of video.
More and more for the application demand of present coding and decoding video, present client is all based on to compile with video and solveThe CPU of code function, such as the E3 of intel, cooperate the tall and handsome P4 card reached.Current E3 integrated codec chip maximum processing capabilityIt is 12 road 1080P.And only have 2 chips to be used for coding and decoding video in P4 card, if being intended only as encoding and decoding single application, P4Other calculated performances possessed by blocking undoubtedly are exactly to waste, and the price of P4 card is more expensive.Based on current many customers' placesSmall video is managed, the multiple business such as live streaming even more occurs in security protection, on the road of encoding and decoding and cost has higher wantIt asks.
Summary of the invention
For disadvantage mentioned above, the embodiment of the present invention proposes the acceleration system and method for a kind of multi-channel video encoding and decoding, dropLow cost, while improving the speed of the encoding and decoding of multi-channel video.
The embodiment of the present invention proposes a kind of acceleration system of multi-channel video encoding and decoding, including Camera, Server and addsSpeed card, the accelerator card includes BMC control module, chip processing module and power module;
The BMC control module includes BMC, house dog, fan, temperature sensor and EEPROM;The BMC passes through UARTIt is connected with house dog;The BMC is connected with fan, temperature sensor and EEPROM respectively by I2C;
The chip processing module includes 2,3 or 4 fpga chips;The fpga chip is separately connected two DDR4 memoriesItem, jtag circuit, SPI Flash and EMMC;It is interconnected between the fpga chip using Chiplink, and the fpga chipSwitch is also connected to by UART;
The power module includes power regulator;The power regulator connects external+12V power supply, while controlling with BMCMolding block is connected with chip processing module;
The power module is BMC control module and chip processing module for power supply;The BMC control module is used for out-of-band supervisionKeyholed back plate reason;The chip processing module is for carrying out multi-channel video encoding and decoding.
Further, the accelerator card further include programmable clock chip, indicator light key, reset key, test point byKey and external interface;
The programmable clock chip is for being kept for synchronous clock, display and record time;
The indicator light key, which is used to indicate, accelerates card failure or in place;
The reset key is used to restart in case of constant power when accelerator card breaks down;
The input and output of pin when the test point key is tested for fpga chip;
The external interface includes USB and HUB.
Further, the fpga chip uses the ZU7EV chip of Xilinx.
Further, the capacity of the DDR4 memory bar is 4GB.
Further, it is interconnected between the fpga chip using Chiplink, is used between the fpga chipThe port specification of Chiplink interconnection is Serdes x4, and the fpga chip walks line rate 10Gbps.
A kind of accelerated method of multi-channel video encoding and decoding is that the acceleration system based on a kind of multi-channel video encoding and decoding is realized,It the described method comprises the following steps:
The H.264/H.265 coded data that S1:Server transmits Camera IP Camera is sent in accelerator cardChip processing module;
S2: the H.264/H.265 coded data that the chip processing module in accelerator card transmits Camera IP CameraIt is averagely allocated to each fpga chip according to code stream, each fpga chip is decoded the bit stream data received, so firstCNN reasoning acceleration is carried out afterwards and retrieval accelerates;
S3: accelerator card will be sent to Server by each processed bit stream data of fpga chip in chip processing moduleIn memory.
Further, step S1 includes:
Server memory is written in the H.264/H.265 coded data that NIC transmits Camera IP Camera in ServerFirst memory headroom of middle App process;
Accelerator card driving is called, and applies for the second required memory headroom of accelerator card driving in accelerator card memory, and willThe H.264/H.265 coded data copy of Camera IP Camera transmission is mapped to the second memory headroom.
Further, step S2 includes:
Server writes the PCIE accelerator card register in the space MMCFG, and FPGA reads Camera IP Camera using DMAThe H.264/H.265 coded data of transmission, by the H.264/H.265 coded data of Camera IP Camera transmission from theTwo spaces are transported to chip processing module, are evenly distributed to each fpga chip according to code stream by Switch;
H.264/H.265, the fpga chip is decoded the bit stream data that comes of distribution, and by decoded dataIt carries out CNN reasoning acceleration and retrieval accelerates.
Further, step S3 includes:
After the fpga chip completes CNN reasoning acceleration and retrieval acceleration, MSI interrupt is initiated to Server, Server writesPCIE accelerator card register in the space MMCFG, FPGA use DMA write operation, after completing CNN reasoning acceleration and retrieval accelerationData copy the second memory headroom to from FPGA;
Server is copied the data that CNN reasoning accelerates and retrieves after accelerating are completed or is mapped to from the second memory headroomFirst memory headroom of App process;
Accelerator card driving, which is called, to be returned.
The effect provided in summary of the invention is only the effect of embodiment, rather than invents all whole effects, above-mentionedA technical solution in technical solution have the following advantages that or the utility model has the advantages that
The embodiment of the present invention proposes a kind of acceleration system of multi-channel video encoding and decoding, the system include Camera,Server and accelerator card.Wherein accelerator card includes BMC control module, chip processing module and power module;BMC control module is usedIn out of band supervision management, chip processing module is BMC control module and chip for carrying out multi-channel video encoding and decoding, power moduleProcessing module power supply.BMC control module includes BMC, house dog, fan, temperature sensor and EEPROM, BMC by UART withHouse dog is connected, and BMC also passes through I2C and is connected respectively with fan, temperature sensor and EEPROM.Chip processing module includes 2,3Or 4 fpga chips, fpga chip is separately connected two DDR4 memory bars, jtag circuit, SPI Flash and EMMC, eachIt is interconnected between fpga chip using Chiplink, and fpga chip is connected to Switch.Power module includes electric power adjustmentDevice, power regulator connects external+12V power supply, while being connected with BMC control module and chip processing module.Based on the multichannelA kind of acceleration system of coding and decoding video, it is also proposed that accelerated method of multi-channel video encoding and decoding.The present invention uses fpga chipIt realizes, without using the CPU for having coding and decoding video function, such as the E3 of intel, cooperates the tall and handsome P4 card reached, it is big in costAmplitude is reduced.In addition 2,3 or 3 fpga chips are used, the speed of coding and decoding video can be improved according to actual needs.Fpga chip uses the ZU7EV chip of Xilinx, and each ZU7EV chip can carry out the coding and decoding video on 8 tunnels, 2 FPGA coresPiece can carry out 16 road coding and decoding videos, and 3 fpga chips can carry out 24 road coding and decoding videos, and 4 fpga chips can be into32 road coding and decoding video of row, improves the speed of coding and decoding video.
Detailed description of the invention
Fig. 1 is the structure connection signal of accelerator card in a kind of acceleration system of the multi-channel video encoding and decoding of the embodiment of the present invention 1Figure;
Fig. 2 is 4 fpga chips of accelerator card in the acceleration system based on a kind of multi-channel video encoding and decoding of the embodiment of the present invention 1Interconnect topological diagram;
Fig. 3 is the overall data process figure of the accelerated method based on a kind of multi-channel video encoding and decoding of the embodiment of the present invention 1.
Specific embodiment
In order to clarify the technical characteristics of the invention, below by specific embodiment, and its attached drawing is combined, to this hairIt is bright to be described in detail.Following disclosure provides many different embodiments or example is used to realize different knots of the inventionStructure.In order to simplify disclosure of the invention, hereinafter the component of specific examples and setting are described.In addition, the present invention can be withRepeat reference numerals and/or letter in different examples.This repetition is that for purposes of simplicity and clarity, itself is not indicatedRelationship between various embodiments and/or setting is discussed.It should be noted that illustrated component is not necessarily to scale in the accompanying drawingsIt draws.Present invention omits the descriptions to known assemblies and treatment technology and process to avoid the present invention is unnecessarily limiting.
Embodiment 1
The embodiment of the present invention 1 provides a kind of acceleration system of multi-channel video encoding and decoding, the system include Camera,Server and accelerator card.
Camera is for providing video data;Server receives video data for connecting, and to accelerator card encoding and decoding numberAccording to carrying out control management.
It is the structure company of accelerator card in a kind of acceleration system of the multi-channel video encoding and decoding of the embodiment of the present invention 1 as shown in Figure 1Connect schematic diagram.Accelerator card includes BMC control module, chip processing module and power module;
BMC control module is used for out of band supervision management, and chip processing module is for carrying out multi-channel video encoding and decoding, power supply mouldBlock is BMC control module and chip processing module for power supply.
BMC control module includes BMC, house dog, fan, temperature sensor and EEPROM, and BMC passes through UART and house dogIt is connected, BMC also passes through I2C and is connected respectively with fan, temperature sensor and EEPROM.
Chip processing module includes 2,3 or 4 fpga chips, and fpga chip is separately connected two DDR4 memory bars, JTAGCircuit, SPI Flash and EMMC.The capacity of two of them DDR4 memory bar is 4GB, band ECC, frequency 2400MH.
It is interconnected between each fpga chip using Chiplink.It is based on a kind of multichannel of the embodiment of the present invention 1 as shown in Figure 24 fpga chips of accelerator card interconnect topological diagram in the acceleration system of coding and decoding video;The port specification of each fpga chipSerdes (GTH) x4, walking line rate is 10Gbps.
Fpga chip is connected to Switch by UART.Accelerator card is communicated by PCIE golden finger with Server,Server server provides the signal of X8, is converted into the signal of 4 X8 to fpga chip by PCIe Switch.
Fpga chip uses the ZU7EV chip of Xilinx, and each ZU7EV chip can carry out the coding and decoding video on 8 tunnels, and 2A fpga chip can carry out 16 road coding and decoding videos, and 3 fpga chips can carry out 24 road coding and decoding videos, 4 FPGA coresPiece can carry out 32 road coding and decoding videos.Using 4 fpga chips as explanation in the embodiment of the present invention 1, the model that the present invention protectsIt encloses and is not limited to the embodiment 1.
Power module includes power regulator, and power regulator connects external+12V power supply, at the same with BMC control module andChip processing module is connected.
Accelerator card further includes programmable clock chip, indicator light key, reset key, test point key and external interface;
Programmable clock chip is for being kept for synchronous clock, display and record time;Programmable clock chip respectively withZU7EV0 chip, ZU7EV1 chip, ZU7EV2 chip are connected with ZU7EV3 chip.
Indicator light key, which is used to indicate, accelerates card failure or in place;
Reset key is used to restart in case of constant power when accelerator card breaks down;
The input and output of pin when test point key is tested for fpga chip;
External interface includes USB and HUB.
Accelerator card further includes high-definition digital display interface Display port, high-definition digital display interface Display portIt is connected with ZU7EV0 chip, ZU7EV1 chip, ZU7EV2 chip and ZU7EV3 chip.
A kind of acceleration system based on a kind of multi-channel video encoding and decoding that the embodiment of the present invention 1 proposes, it is also proposed that multichannelThe accelerated method of coding and decoding video.
Before the accelerated method for executing a kind of multi-channel video encoding and decoding, 32 road camera videos compile solution in Camera firstCode data Camera0, Camera1 ... Camera31 passes through the NIP that Switch is sent to Server.
Then, the 32 road cameras that execution execution step S1:Server transmits Camera IP Camera are H.264/H.265 coded data is sent to the chip processing module in accelerator card;
S2: the 32 road cameras that the chip processing module in accelerator card transmits Camera IP Camera are H.264/H.265 coded data distributes 8 road bit stream datas according to each fpga chip, and each fpga chip is to the 8 road code stream numbers receivedAccording to be decoded first, then carry out CNN reasoning acceleration and retrieval accelerate;
S3: accelerator card will be sent to Server by each processed bit stream data of fpga chip in chip processing moduleIn memory.
It is illustrated in figure 3 the overall data stream of the accelerated method based on a kind of multi-channel video encoding and decoding of the embodiment of the present invention 1Cheng Tu.
The 32 road cameras that NIC transmits Camera IP Camera in process 1:Server H.264/H.265 coded numberAccording to the first memory headroom of App process in write-in Server memory.
Process 2: calling accelerator card driving, and applies for that the second required memory of accelerator card driving is empty in accelerator card memoryBetween, and H.264/H.265 coded data is copied or is mapped in second by 32 road cameras that Camera IP Camera transmitsDeposit space.
Process 3:Server writes the PCIE accelerator card register in the space MMCFG, and FPGA reads Camera network using DMA32 road cameras of thecamera head H.264/H.265 coded data, and the 32 tunnels camera shooting that Camera IP Camera is transmittedH.264/H.265 coded data from second space is transported to chip processing module to head, by Switch according to each fpga chipDistribute 8 road bit stream datas.
H.264/H.265, process 4:FPGA chip is decoded the 8 road bit stream datas that distribution comes.
Decoded data are carried out CNN reasoning acceleration to process 5:FPGA chip and retrieval accelerates.
After process 6:FPGA chip completes CNN reasoning acceleration and retrieval acceleration, MSI interrupt, Server are initiated to ServerThe PCIE accelerator card register in the space MMCFG is write, FPGA uses DMA write operation, will complete CNN reasoning and accelerate and retrieve to accelerateData afterwards copy the second memory headroom to from fpga chip;
Process 7:Server is copied the data that CNN reasoning accelerates and retrieves after accelerating are completed or is reflected from the second memory headroomIt is mapped to the first memory headroom of App process.
Process 8: accelerator card driving, which is called, to be returned.
Although specification and drawings and examples have been carried out detailed description to the invention, this field skillArt personnel should be appreciated that and still can be modified or replaced equivalently to the invention;And all do not depart from wound of the present inventionThe technical solution and its improvement for the spirit and scope made, are encompassed by the protection scope of the invention patent.

Claims (9)

CN201910090918.2A2019-01-302019-01-30 A system and method for accelerating multi-channel video encoding and decodingActiveCN109525844B (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN201910090918.2ACN109525844B (en)2019-01-302019-01-30 A system and method for accelerating multi-channel video encoding and decoding

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN201910090918.2ACN109525844B (en)2019-01-302019-01-30 A system and method for accelerating multi-channel video encoding and decoding

Publications (2)

Publication NumberPublication Date
CN109525844Atrue CN109525844A (en)2019-03-26
CN109525844B CN109525844B (en)2021-07-27

Family

ID=65799779

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN201910090918.2AActiveCN109525844B (en)2019-01-302019-01-30 A system and method for accelerating multi-channel video encoding and decoding

Country Status (1)

CountryLink
CN (1)CN109525844B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN111131873A (en)*2019-12-262020-05-08曙光网络科技有限公司Server audio and video data processing method and processing device
CN112672166A (en)*2020-12-242021-04-16北京睿芯高通量科技有限公司Multi-code stream decoding acceleration system and method of video decoder
CN112686901A (en)*2021-03-112021-04-20北京小白世纪网络科技有限公司US-CT image segmentation method and device based on deep neural network

Citations (7)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20020072047A1 (en)*1999-12-132002-06-13Michelson Daniel R.System and method for generating composite video images for karaoke applications
EP1797721A1 (en)*2004-09-082007-06-20Inlet Technologies, Inc.Slab-based processing engine for motion video
CN101729879A (en)*2009-12-152010-06-09山东大学Method for realizing real-time video transmission based on MIMO-OFDM system
US20140219361A1 (en)*2013-02-012014-08-07Samplify Systems, Inc.Image data encoding for access by raster and by macroblock
CN106210725A (en)*2016-08-032016-12-07广东技术师范学院A kind of video image compression perception coder/decoder system and decoding method thereof
CN206807656U (en)*2017-06-122017-12-26普讯科技(大连)有限公司 Audio Video Signal Processing Unit
CN208316891U (en)*2018-07-032019-01-01都邑科技(北京)有限公司A kind of image processing system

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20020072047A1 (en)*1999-12-132002-06-13Michelson Daniel R.System and method for generating composite video images for karaoke applications
EP1797721A1 (en)*2004-09-082007-06-20Inlet Technologies, Inc.Slab-based processing engine for motion video
CN101729879A (en)*2009-12-152010-06-09山东大学Method for realizing real-time video transmission based on MIMO-OFDM system
US20140219361A1 (en)*2013-02-012014-08-07Samplify Systems, Inc.Image data encoding for access by raster and by macroblock
CN106210725A (en)*2016-08-032016-12-07广东技术师范学院A kind of video image compression perception coder/decoder system and decoding method thereof
CN206807656U (en)*2017-06-122017-12-26普讯科技(大连)有限公司 Audio Video Signal Processing Unit
CN208316891U (en)*2018-07-032019-01-01都邑科技(北京)有限公司A kind of image processing system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
吴明美: "基于FPGA的硬件图像处理技术", 《中国优秀硕士论文电子期刊》*

Cited By (4)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN111131873A (en)*2019-12-262020-05-08曙光网络科技有限公司Server audio and video data processing method and processing device
CN112672166A (en)*2020-12-242021-04-16北京睿芯高通量科技有限公司Multi-code stream decoding acceleration system and method of video decoder
CN112672166B (en)*2020-12-242023-05-05北京睿芯高通量科技有限公司Multi-code stream decoding acceleration system and method for video decoder
CN112686901A (en)*2021-03-112021-04-20北京小白世纪网络科技有限公司US-CT image segmentation method and device based on deep neural network

Also Published As

Publication numberPublication date
CN109525844B (en)2021-07-27

Similar Documents

PublicationPublication DateTitle
US10067901B2 (en)Methods and apparatuses for providing data received by a state machine engine
CN101833424B (en)High-speed storage and transmission device based on FPGA
CN109525844A (en)A kind of acceleration system and method for multi-channel video encoding and decoding
US8909716B2 (en)Administering truncated receive functions in a parallel messaging interface
CN108268425A (en) Programmable Matrix Processing Engine
CN104954795B (en)A kind of image acquisition transmission system based on JPEG2000
WO2020025006A1 (en)Data compression and decompression method and related apparatus, electronic device, and system
Durante et al.100 Gbps PCI-express readout for the LHCb upgrade
US20190370213A1 (en)Configuring compute nodes in a parallel computer using remote direct memory access ('rdma')
TW201305821A (en)Flexible flash commands
US20090193234A1 (en)Sequencer controlled system and method for controlling timing of operations of functional units
US20220147097A1 (en)Synchronization signal generating circuit, chip and synchronization method and device, based on multi-core architecture
WO2020083019A1 (en)Decoding method based on multi-core processor, terminal device and storage medium
CN105487999A (en)Cascaded large-scale USB expansion device and working method and system
US20140063183A1 (en)Method of processing multi-view image and apparatus for executing the same
CN104394415B (en)A kind of method of video big data distribution decoding
CN105915890A (en)FPGA based SVAC video coding and decoding chip verifying apparatus and method
CN105684035B (en) Group and compress similar photos
CN205566357U (en)Realize data center frame switch ply -yarn drill framework based on CPLD
US20140146067A1 (en)Accessing Configuration and Status Registers for a Configuration Space
CN109213710A (en)High-speed serial interface device and data transmission method thereof
CN107248883B (en) A real-time monitoring system for high-throughput load balancing hardware
JP2013015985A (en)Soc device capable of execution trace dumping of all processor cores of multi-core processor
US20140258748A1 (en)Collective Operation Management In A Parallel Computer
CN204652526U (en)A kind of video acquisition memory circuit based on FPGA

Legal Events

DateCodeTitleDescription
PB01Publication
PB01Publication
SE01Entry into force of request for substantive examination
SE01Entry into force of request for substantive examination
GR01Patent grant
GR01Patent grant

[8]ページ先頭

©2009-2025 Movatter.jp