Movatterモバイル変換

[0]ホーム

Jump to content

Distributed data store

Edit links

From Wikipedia, the free encyclopedia

Computer network with multiple nodes to store information

This articleis written like apersonal reflection, personal essay, or argumentative essay that states a Wikipedia editor's personal feelings or presents an original argument about a topic. Pleasehelp improve it by rewriting it in anencyclopedic style.(May 2012) (Learn how and when to remove this message)

Computer memory anddata storage types
General Memory cell Memory coherence Cache coherence Memory hierarchy Memory access pattern Memory map Secondary storage MOS memory floating-gate Continuous availability Areal density (computer storage) Block (data storage) Object storage Direct-attached storage Network-attached storage Storage area network Block-level storage Single-instance storage Data Structured data Unstructured data Big data Metadata Data compression Data corruption Data cleansing Data degradation Data integrity Data security Data validation Data validation and reconciliation Data recovery Storage Data cluster Directory Shared resource File sharing File system Clustered file system Distributed file system Distributed file system for cloud Distributed data store Distributed database Database Data bank Data storage Data store Data deduplication Data structure Data redundancy Replication (computing) Memory refresh Storage record Information repository Knowledge base Computer file Object file File deletion File copying Backup Core dump Hex dump Data communication Information transfer Temporary file Copy protection Digital rights management Volume (computing) Boot sector Master boot record Volume boot record GUID Partition Table Disk array Disk image Disk mirroring Disk aggregation Disk partitioning Memory segmentation Locality of reference Logical disk Storage virtualization Virtual memory Memory-mapped file Software entropy Software rot In-memory database In-memory processing Persistence (computer science) Persistent data structure RAID Non-RAID drive architectures Memory paging Bank switching Grid computing Cloud computing Cloud storage Fog computing Edge computing Dew computing The law Martiels law
Volatile
RAM Hardware cache CPU cache Scratchpad memory DRAM eDRAM SDRAM SGRAM DDR GDDR LPDDR QDRSRAM EDO DRAM XDR DRAM RDRAM HBM SRAM 1T-SRAM ReRAM QRAM Content-addressable memory (CAM) Computational RAM VRAM Dual-ported RAM Video RAM (dual-ported DRAM)
Historical DC3MWCP (1946–1947) Delay-line memory (1947) Mellon optical memory (1951) Selectron tube (1952) Dekatron T-RAM (2009) Z-RAM (2002–2010)
Non-volatile
ROM Diode matrix MROM PROM EPROM EEPROM ROM cartridge Solid-state storage (SSS) Flash memory is used in: Solid-state drive (SSD) Solid-state hybrid drive (SSHD) USB flash drive IBM FlashSystem Flash Core Module Memory card Memory Stick CompactFlash PC Card MultiMediaCard SD card SIM card SmartMedia Universal Flash Storage SxS MicroP2 XQD card Programmable metallization cell
NVRAM Memistor Memristor PCM (3D XPoint) MRAM Electrochemical RAM (ECRAM) Nano-RAM CBRAM
Early-stageNVRAM FeRAM ReRAM FeFET memory
Analog recording Phonograph cylinder Phonograph record Quadruplex videotape Vision Electronic Recording Apparatus Magnetic recording Magnetic storage Magnetic tape Magnetic-tape data storage Tape drive Tape library Digital Data Storage (DDS) Videotape Cassette tape Linear Tape-Open Betamax 8 mm video format DV MiniDV MicroMV U-matic VHS S-VHS VHS-C D-VHS Hard disk drive
Optical 3D optical data storage Optical disc LaserDisc Compact Disc Digital Audio (CDDA) CD CD Video CD-R CD-RW Video CD Super Video CD Mini CD Nintendo optical discs CD-ROM Hyper CD-ROM DVD DVD+R DVD-Video DVD card DVD-RAM MiniDVD HD DVD Blu-ray Ultra HD Blu-ray Holographic Versatile Disc WORM
In development CBRAM Racetrack memory NRAM Millipede memory ECRAM Patterned media Holographic data storage Electronic quantum holography 5D optical data storage DNA digital data storage Universal memory Time crystal Quantum memory UltraRAM
Historical Paper data storage (1725) Punched card (1725) Punched tape (1725) Plugboard Drum memory (1932) Magnetic-core memory (1949) Plated-wire memory (1957) Core rope memory (1960s) Thin-film memory (1962) Disk pack (1962) Twistor memory (~1968) Bubble memory (~1970) Floppy disk (1971)
v t e

Adistributed data store is acomputer network where information is stored on more than onenode, often in areplicated fashion.^[1] It is usually specifically used to refer to either adistributed database where users store information on anumber of nodes, or acomputer network in which users store information on anumber of peer network nodes.^{[citation needed]}

Distributed databases

[edit]

Distributed databases are usuallynon-relational databases that enable a quick access to data over a large number of nodes. Some distributed databases expose rich query abilities while others are limited to akey-value store semantics. Examples of limited distributed databases areGoogle'sBigtable, which is much more than adistributed file system or apeer-to-peer network,^[2]Amazon'sDynamo^[3]andMicrosoft Azure Storage.^[4]

As the ability of arbitrary querying is not as important as theavailability, designers of distributed data stores have increased the latter at an expense of consistency. But the high-speed read/write access results in reduced consistency, as it is not possible to guarantee bothconsistency and availability on a partitioned network, as stated by theCAP theorem.

Peer network node data stores

[edit]

In peer network data stores, the user can usually reciprocate and allow other users to use their computer as a storage node as well. Information may or may not be accessible to other users depending on the design of the network.

Mostpeer-to-peer networks do not have distributed data stores in that the user's data is only available when their node is on the network. However, this distinction is somewhat blurred in a system such asBitTorrent, where it is possible for the originating node to go offline but the content to continue to be served. Still, this is only the case for individual files requested by the redistributors, as contrasted with networks such asHyphanet,Winny,Share andPerfect Dark where any node may be storing any part of the files on the network.

Distributed data stores typically use anerror detection and correction technique.Some distributed data stores (such asParchive over NNTP) useforward error correction techniques to recover the original file when parts of that file are damaged or unavailable.Others try again to download that file from a different mirror.

Examples

[edit]

Distributed non-relational databases

[edit]

Product	License	High availability	Notes
Apache Accumulo	AL2
Aerospike	AGPL
Apache Cassandra	AL2	Yes	formerly used byFacebook
Apache Ignite	AL2
Bigtable	Proprietary		used byGoogle
Couchbase	AL2		used byLinkedIn,PayPal, andeBay
CrateDB	AL2	Yes
Apache Druid	AL2		used byNetflix, andYahoo
Dynamo	Proprietary		used byAmazon
etcd	AL2	Yes
Hazelcast	AL2, Proprietary
HBase	AL2	Yes	formerly used by Facebook
Hypertable	GPL 2		Baidu
MongoDB	SSPL
MySQL NDB Cluster	GPL 2	Yes	SQL and NoSQL APIs
Riak	AL2	Yes
Redis	BSD License	Yes
ScyllaDB	AGPL
Voldemort	AL2		used byLinkedIn

Peer network node data stores

[edit]

BitTorrent
Blockchain (database)
Chord project
Freenet
GNUnet
IPFS
Mnet
Napster
NNTP (the distributed data storage protocol used forUsenet news)
Unity, of the softwarePerfect Dark
Share
Siacoin
DeNet
Storage@home
Tahoe-LAFS
Winny
ZeroNet

References

[edit]

^Yaniv Pessach,Distributed Storage (Distributed Storage: Concepts, Algorithms, and Implementations ed.),OL 25423189M
^"Bigtable: Google's Distributed Data Store". Paper Trail. Archived fromthe original on 2017-07-16. Retrieved2011-04-05.Although GFS provides Google with reliable, scalable distributed file storage, it does not provide any facility for structuring the data contained in the files beyond a hierarchical directory structure and meaningful file names. It's well known that more expressive solutions are required for large data sets. Google's terabytes upon terabytes of data that they retrieve from web crawlers, amongst many other sources, need organising, so that client applications can quickly perform lookups and updates at a finer granularity than the file level. [...] The very first thing you need to know about Bigtable is that it isn't a relational database. This should come as no surprise: one persistent theme through all of these large scale distributed data store papers is that RDBMSs are hard to do with good performance. There is no hard, fixed schema in a Bigtable, no referential integrity between tables (so no foreign keys) and therefore little support for optimised joins.
^Sarah Pidcock (2011-01-31)."Dynamo: Amazon's Highly Available Key-value Store"(PDF). WATERLOO – CHERITON SCHOOL OF COMPUTER SCIENCE. p. 2/22. Retrieved2011-04-05.Dynamo: a highly available and scalable distributed data store
^"Windows Azure Storage".Microsoft. 2011-09-16. Archived fromthe original on 9 November 2011. Retrieved6 November 2011.

Retrieved from "https://en.wikipedia.org/w/index.php?title=Distributed_data_store&oldid=1312403767"

Categories:

Hidden categories:

[8]ページ先頭

Movatterモバイル変換

Distributed databases

Peer network node data stores

Examples

Distributed non-relational databases

Peer network node data stores

See also

References