Original author(s) | Margo Seltzer andKeith Bostic ofSleepycat Software |
---|---|
Developer(s) | Sleepycat Software, laterOracle Corporation |
Initial release | 1994; 31 years ago (1994) |
Stable release | |
Written in | C |
Operating system | Windows,Unix-like |
Size | ~1244kB compiled onWindowsx86 |
Type | Embedded database,NoSQL Database |
License | Dual licensed (GNU Affero General Public License andproprietary license |
Website | www |
Berkeley DB (BDB) is anembedded databasesoftwarelibrary forkey/value data, historically significant inopen-source software. Berkeley DB is written inC with API bindings for many otherprogramming languages. BDB stores arbitrary key/data pairs as byte arrays and supports multiple data items for a single key. Berkeley DB is not arelational database,[2] although it has database features includingdatabase transactions,multiversion concurrency control andwrite-ahead logging. BDB runs on a wide variety ofoperating systems, including mostUnix-like andWindows systems, andreal-time operating systems.
BDB was commercially supported and developed bySleepycat Software from 1996 to 2006. Sleepycat Software was acquired byOracle Corporation in February 2006, who continued to develop and sell the C Berkeley DB library. In 2013 Oracle re-licensed BDB under theAGPL license[3][4] and released new versions until May 2020.Bloomberg L.P. continues to develop afork of the 2013 version of BDB within theirComdb2 database, under the originalSleepycat permissive license.
Berkeley DB originated at theUniversity of California, Berkeley as part ofBSD, Berkeley's version of theUnix operating system. After 4.3BSD (1986), the BSD developers attempted to remove or replace all code originating in the originalAT&T Unix from which BSD was derived. In doing so, they needed to rewrite the Unix database package.[5] Seltzer and Yigit[6] created a new database, unencumbered by any AT&T patents: an on-diskhash table that outperformed the existingdbm libraries. Berkeley DB itself was first released in 1991 and later included with 4.4BSD.[5] In 1996Netscape requested that the authors of Berkeley DB improve and extend the library, then at version 1.86, to suit Netscape's requirements for anLDAP server[7] and for use in theNetscape browser. That request led to the creation ofSleepycat Software. This company was acquired byOracle Corporation in February 2006.[8]
Berkeley DB 1.x releases focused on managing key/value data storage and are referred to as "Data Store" (DS). The 2.x releases added a locking system enabling concurrent access to data. This is what is known as "Concurrent Data Store" (CDS). The 3.x releases added a logging system for transactions and recovery, called "Transactional Data Store" (TDS). The 4.x releases added the ability to replicate log records and create a distributed highly available single-master multi-replica database. This is called the "High Availability" (HA) feature set. Berkeley DB's evolution has sometimes led to minor API changes or log format changes, but very rarely have database formats changed. Berkeley DB HA supports online upgrades from one version to the next by maintaining the ability to read and apply the prior release's log records.[citation needed]
Starting with the 6.0.21 (Oracle 12c) release, all Berkeley DB products are licensed under theGNU AGPL.[9][10] Previously, Berkeley DB was redistributed under the 4-clauseBSD license (before version 2.0), and the Sleepycat Public License, which is anOSI-approvedopen-source license as well as anFSF-approvedfree software license.[11][12] The product ships with complete source code, build script, test suite, and documentation. The comprehensive feature along with the licensing terms have led to its use in a multitude offree and open-source software. Those who do not wish to abide by the terms of the GNU AGPL, or use an older version with the Sleepycat Public License, have the option of purchasing anotherproprietary license for redistribution fromOracle Corporation. This technique is calleddual licensing.
Berkeley DB includes compatibility interfaces for some historic Unix database libraries:dbm, ndbm and hsearch (aSystem V andPOSIX library for creating in-memoryhash tables).[13]
Berkeley DB has an architecture notably simpler thanrelational database management systems. LikeSQLite andLMDB, it is not based on aserver/client model, and does not provide support for network access – programs access the database using in-processAPI calls. Oracle added support for SQL in 11g R2 release based on the popular SQLite API by including a version of SQLite in Berkeley DB (it uses Berkeley DB for storage).[14]
A program accessing the database is free to decide how the data is to be stored in a record. Berkeley DB puts no constraints on the record's data. The record and its key can both be up to four gigabytes long.
Berkeley DB supports database features such asACIDtransactions, fine-grainedlocking, hotbackups andreplication.
The name "Berkeley DB" is used by Oracle Corporation for three different products, only one of which is BDB:[15]
BDB was once very widespread, but usage dropped steeply from 2013 (seelicensing section). Notable software that still uses Berkeley DB for data storage include:
Open-source operating systems and languages such asPerl andPython still support old BerkelyDB interfaces. TheFreeBSD andOpenBSD operating systems ship Berkeley DB 1.8x to support thedbopen()
[19][20] operating system call used by password programs such aspwb_mkdb
.[21] Linux operating systems, including those based on Debian,[22] and Fedora[23] ship Berkeley DB 5.3 libraries.
Berkeley DB V2.0 and higher is available under adual license:
Switching the open source license in 2013 from theSleepycat license to the AGPL had a major effect on open source software. Since BDB is a library, any application linking to it must be under an AGPL-compatible license. Many open source applications and all closed source applications would need to be relicensed to become AGPL-compatible, which was not acceptable to many developers and open source operating systems. By 2013 there were many alternatives to BDB, andDebian Linux was typical in their decision to completely phase out Berkeley DB, with a preference for theLightning Memory-Mapped Database (LMDB).[26]