CROSS-REFERENCE TO RELATED APPLICATIONSThe present application claims priority from U.S. Provisional Patent Application Ser. No. 60/369,890 for “Associating and Linking Album Tags, Table of Contents Data, and Other Compact Disc Data,” filed Apr. 3, 2002, the disclosure of which is incorporated herein by reference.
The present application is related to U.S. patent application Ser. No. 10/167,807 for “Music Information Retrieval,” filed Jun. 11, 2002, the disclosure of which is incorporated herein by reference.
BACKGROUND OF THE INVENTION1. Field of the Invention
The present invention relates generally to playing, copying, and managing music files, such as those found on compact discs (CDs), and more particularly to techniques for retrieving, associating, and linking various sources of metadata for the music files.
2. Description of the Background Art
Most audio CDs contain only digital music and a table of contents that tells the player how many tracks are present, the length of each track, and where on the disc each track starts. In general, audio CDs do not carry “metadata” such as the title of the album, the artist that recorded it, and the names of the songs or tracks contained on it.
Metadata is of value when playing, copying, and managing music files. Metadata includes descriptive information about music tracks and albums, so as to allow users to more easily identify files. For example, if a song title is available as metadata, an audio CD player can look up additional information from a web server, and can display artist name, album title, and song title while a track is being played. The user can view the contents of the CD by track name and select the tracks by name for random access playback. If a user makes a copy of a music file from a CD (a process known as “ripping”), or downloads a music file from an online source, metadata can be used as a default filename. Metadata is also useful in organizing and categorizing a collection of files, for example by artist name or musical genre; metadata can also be used as an identifier for gaining access to additional information about a song, artist, or CD.
Existing music players and CD management (“ripper”) software applications that make use of available metadata include, for example:
- Real Jukebox, available and CD from RealNetworks, Inc. of Seattle, Wash.;
- Winamp, available from Nullsoft, Inc., a division of America Online, Inc., of Dulles, Va.;
- Easy CD Creator 5, available from Roxio, Inc. of Santa Clara, Calif.;
- KDE CD Player for Linux; and
- Toast, available from Roxio, Inc. of Santa Clara, Calif.
Conventionally, since CDs themselves do not generally carry metadata, some music client software applications (including players and CD management software) obtain metadata from external sources. For example, in some software applications users enter metadata manually, and the entered metadata is then associated with some unique characteristics of a CD (such as track lengths), so that the entered metadata can later be accessed whenever the CD is re-inserted in the user's computer. Alternatively, the client software can obtain metadata from a central server that is run by an application service provider (ASP). Such functionality is provided, for example, by CDDB, available from Gracenote of Berkeley, Calif., or by FreeDB, available at “http://www.freedb.org”. The Windows Media Player client, available from Microsoft Corporation of Redmond, Wash., also communicates with a CD Metadata ASP to obtain metadata.
FIG. 1 depicts a system for obtaining metadata from a remote server according to the prior art. An end user of client software102 (such as a music player or ripper application) insertsCD101 in a connected CD drive (not shown).Client software102 requests thatmetadata client module104 identifyCD101 and return the metadata for use bysoftware102.Client module104 may derive a Disc ID from available information fromCD101, such as table of contents (TOC).Module104 then sends the Disc ID over a network connection to a remote metadata ASP103. The request is typically sent over the Internet using Hypertext Transfer Protocol (HTTP) as a transmission protocol.Metadata server105 receives the request and runs a query onmetadata database106 to find a metadata record that matches the Disc ID.Database106 provides the metadata, which is then transmitted back tomodule104.Module104 makes the data available tosoftware102.
Existing systems such as CDDB and FreeDB create the metadata database from input provided by users of the system. If, when a CD is inserted, no matching metadata is found,software102 prompts the user to enter the metadata manually. The user-entered metadata is then transmitted back to Metadata ASP103 and is added todatabase106 and associated with a new disc identifier for the inserted CD.
FIG. 2 depicts this method in more detail, as generally performed bymetadata client104 according to the prior art.Client104extracts201 Disc IDs from the CD TOC data.Client104 then transmits202 the Disc IDs tometadata server105. Ifserver105 indicates203 that matching metadata was found indatabase106, the metadata is made available206 toclient software102. Ifserver105 indicates203 that no match was found,client104 prompts theuser202 to enter the missing metadata. Once the user has provided the metadata,client104 transmits205 the metadata toserver105, which saves the metadata indatabase106. The metadata is made available206 toclient software102.
FIG. 3 depicts a server method of obtaining metadata according to the prior art.Server105 receives arequest301 for metadata fromclient104 over HTTP or some other protocol.Server105 looks up302 the Disc ID inmetadata database106. If a match exists303,server105 returns304 the metadata for the match toclient104, along with status indicating amatch304. Otherwise,server105 returns a status code indicating that no matches were found.
Existing systems do not provide a mechanism for linking metadata from two or more sources. Thus, if metadata is available from some external source, such as a commercially developed third-party database of music information, existing systems are not generally able to integrate such external metadata with the above-described scheme. In general, then, prior art metadata systems do not take advantage of the availability of more accurate and/or more complete information that may be available from a variety of sources. Furthermore, existing systems do not provide a mechanism for establishing links between metadata records in disparate databases, nor do they allow for management of various data sources having different levels of credibility.
SUMMARY OF THE INVENTIONThe present invention includes techniques for enhancing, associating, and linking various sources of metadata for music files, thus allowing integration of commercially generated metadata with user-entered metadata. The invention provides mechanisms for ensuring that metadata provided to the user is of the highest quality and accuracy available, even when the metadata comes from disparate sources having different levels of credibility. Using the techniques of the invention, redundancies among various data sources can be resolved, and inaccuracies can be corrected. The invention therefore enriches the set of album metadata that is available to the ripping and playback features when interacting with an audio CD.
In one embodiment, as described below, the invention utilizes professional quality metadata from a commercial database when such data is available. If such metadata is not available, the invention falls back on user-entered data. As described in more detail below, the invention generates and maintains a linking database allowing records from two or more metadata databases to be linked. The linking database also stores learned relationships among records from disparate databases, so that future queries can be serviced from the data source having the highest degree of credibility.
In one embodiment, the present invention provides improved techniques for identifying approximate matches, when querying metadata databases. In this manner, the present invention is able to detect matches even in the presence of slight variations in TOC data or other identifying data.
The present invention further provides improved techniques for accepting user submissions of metadata, for categorizing user submissions according to relative credibility, and for integrating user submissions with existing metadata.
BRIEF DESCRIPTION OF THE DRAWINGSFIG. 1 is a block diagram depicting a system for obtaining metadata from a remote server according to the prior art.
FIG. 2 is a flowchart depicting a client method for obtaining metadata from a remote server according to the prior art.
FIG. 3 is a flowchart depicting a server method for looking up metadata according to the prior art.
FIG. 4 is a flowchart depicting a server method for adding metadata information to a database, according to one embodiment of the present invention.
FIG. 5 is a block diagram depicting an improved system for linking and associating metadata from two or more sources, according to one embodiment of the present invention.
FIG. 6 is a flowchart depicting an improved server method for vector matching according to one embodiment of the present invention.
FIG. 7 is a flowchart depicting an improved server method for adding metadata information to a database, according to one embodiment of the present invention.
FIG. 8 is a flowchart depicting a method for creating mappings between Disc IDs and CDs, according to one embodiment of the present invention.
FIG. 9 is a block diagram depicting a metadata server according to one embodiment of the present invention.
FIG. 10 is a block diagram depicting an implementation of the present invention in which several separate metadata databases are provided.
FIG. 11 is a block diagram depicting an implementation of the present invention in which a single database contains metadata records having different levels of credibility.
FIG. 12 is a block diagram depicting an example of a database lookup technique of the present invention, wherein at least one database is missing some information.
FIG. 13 is a block diagram depicting a second example of a database lookup technique of the present invention, wherein at least one database is missing some information.
FIG. 14 is a block diagram depicting a third example of a database lookup technique of the present invention, wherein at least one database is missing some information.
FIG. 15 is a block diagram depicting a fourth example of a database lookup technique of the present invention, wherein at least one database is missing some information, and wherein an audio signature is used to formulate a query.
FIG. 16 is a block diagram depicting a detailed system architecture according to one embodiment of the present invention.
FIGS. 17A,17B, and17C are diagrams depicting the software structure of a metadata server according to one embodiment of the present invention.
FIG. 18 is a block diagram depicting the operation of a log consolidator according to one embodiment of the present invention.
FIG. 19 is a flowchart depicting a method of handling metadata submission according to one embodiment of the present invention.
FIGS. 20A and 20B are screen shots showing examples of dialog boxes for an advanced search function according to one embodiment of the present invention.
FIG. 20C is a screen shot showing an example of a dialog box for confirming CD information according to one embodiment of the present invention.
FIG. 20D is a screen shot showing an example of a dialog box for presenting possible matches according to one embodiment of the present invention.
FIG. 21 is a flowchart depicting details of a method of handling metadata submission according to one embodiment of the present invention.
FIG. 22 is a block diagram of system operation in connection audio signatures, according to one embodiment of the invention.
DETAILED DESCRIPTION OF A PREFERRED EMBODIMENTThe following description of system components and operation is merely exemplary of embodiments of the present invention. One skilled in the art will recognize that the various designs, implementations, and techniques described herein may be used alone or in any combination, and that many modifications and equivalent arrangements can be used. Accordingly, the following description is presented for purposes of illustration, and is not intended to limit the invention to the precise forms disclosed.
In particular, in the following description, the present invention is set forth in the context of a client software application, running on a personal computer, for playing music from a CD. The client software application communicates with a server over a network, using established network protocols. One skilled in the art will recognize, however, that the present invention can be implemented in other environments and contexts. For example, the invention may be implemented in a CD player, consumer electronics product, personal digital assistant (PDA), cell phone, or other device. The invention may be implemented independently of playback of a CD. Furthermore, the invention is not limited to operation with CD-based music; the metadata that is retrieved, processed, and updated by the present invention can be descriptive of music, audio, video, multimedia, or any other type of data, and it can be stored on or retrieved from any medium, including without limitation tapes, discs, mini-discs, hard drives, servers, digital versatile discs (DVDs), and the like.
In one embodiment, the system of the present invention utilizes professional quality metadata from a commercial database when such data is available. If such metadata is not available, the invention falls back on user-entered data. In alternative embodiments, any number of databases or metadata sources are made available according to a hierarchical scheme. The retrieval techniques of the present invention select appropriate sources for metadata according to relative credibility, availability, and completeness of each source. Furthermore, in one embodiment, the present invention provides mechanisms for linking records in disparate metadata sources, so that the sources can be integrated to provide more complete data in an efficient manner.
Vector MatchingIn one embodiment, the invention matches TOC-based queries to entries in ametadata database106 using fixed-dimension vectors containing track length information. Referring now toFIG. 6, there is shown a method for vector matching according to one embodiment of the present invention. Exact and/or approximate matching techniques may be used. Exact matching techniques find identical vectors. Approximate matching techniques determine differences between vectors, employing cosine matching (measuring the angle between vectors), difference matching (measuring the magnitude of the difference between vectors), or the like to determine which vectors are most closely aligned with one another.
The system of the invention formulates a vector based on information that can be read from the CD. For example, in one embodiment,client software102 reads TOC data from the CD, andclient module104 formulates a vector from the track lengths derived from the TOC data. In another embodiment,client module104 transmits the TOC data toserver105, which then formulates the vector.
Oncemetadata request601 is received, the system of the invention creates602 a vector using information fromrequest601 such as TOC data. The vector is formulated, for example, by calculating the length of each track in frames. This may be done by subtracting the start offset of the track from the start offset of the following track. For example, in the following excerpt from a TOC-based query, where the first number denotes the number of tracks or segments on the media and the subsequent numbers represent the starting offsets (in media units) of each track or segment:
16+22+150+25046+40960+55582+74952+90015+114823+131009+142311+157817+173768+196309+217677+231408+240533+252548+260 507+274838+292015+305962+322830+339361+4752,
the lengths are 24896, 15914, 14622, 19370, 15063, 24808, 16186, 11302, 15506, 15951, 22541, 21368, 13731, 9125, 12015, 7959, 14331, 17177, 13947, 16868, and 16531.
In one embodiment, the determined lengths are formed into a fixed-dimension vector, such as for example a 16-dimensional vector. Thus, if the disc has more than 16 tracks, any extra track lengths are discarded; if the disc has fewer than 16 tracks, the track lengths are repeated starting at the first length to fill the remaining positions in the vector. The vector for the above query would thus be: [24896 15914 14622 19370 15063 24808 16186 11302 15506 15951 22541 21368 13731 9125 12015 7959]
Had the disc only 10 tracks, the 1stthrough 6thlengths would be repeated in positions 11-16: [24896 15914 14622 19370 15063 24808 16186 11302 15506 15951 24896 15914 14622 19370 15063 24808]
One skilled in the art will recognize that variable-dimension vectors could also be used.
Once the system has created602 a vector, it determines603 whether any records indatabase106 are associated with Disc IDs that exactly match the determined vector. In one embodiment, the system makesdetermination603 using a hash table that maps the TOC vector to the database records, according to hash table techniques that are well known in the art. In one embodiment, if any matches are found, the system returns604 the matching Disc IDs. For example,server105 may transmit the matching Disc IDs toclient module104. In another embodiment,server105 retrieves metadata associated with the matching Disc IDs, and returns the metadata itself toclient module104.
If, in603, no exact match is found, the system looks for approximate matches. In one embodiment, the system performs606 dot product operations (or some other technique of approximate matching) on the determined vector, comparing it with vectors for Disc IDs indatabase106. If the results of any dot product operations exceed apredetermined threshold607, the system returns608 the matching Disc IDs. For example,server105 may transmit the matching Disc IDs toclient module104. In another embodiment, if the results of any dot product operations exceed thethreshold607,server105 retrieves metadata associated with the matching Disc IDs, and returns the metadata itself toclient module104.
If no dot product results exceed the threshold, the system returns610 a “not found” message. In one embodiment, in such an event,client module104 requests the metadata from another source, and/or prompts the user to enter the metadata manually.
In one embodiment, once the method ofFIG. 6 returns one or more Disc IDs instep604 or608, the Disc IDs are transmitted toclient module104.Client module104 then issues one or more “Read” requests to obtain the metadata for the Disc ID(s) it received. The requests may be transmitted to thesame metadata server105 or to some other server (not shown). Accordingly, in one embodiment,server105 looks up Disc IDs but does not serve metadata itself; that task is left to other servers. In an alternative embodiment,server105 looks up Disc IDs and also serves metadata toclient modules104. In yet another embodiment,server105 serves metadata in response to themetadata request601, without performing the intermediary step of returning Disc IDs.
The following are examples of some techniques of approximate matching that can be used in connection with the present invention, for example instep606 ofFIG. 6.
Cosine MatchA cosine or dot-product match is used to find vectors that are similar but not identical.
Given vectors A and B
A·B=|A∥B|cos θ
Where θ is the angle between A and B. When A and B are normalized to unit vectors, the dot product produces the cosine of the angle between them, which is 1.0 for identical vectors, near 1.0 for similar vectors, and near 0.0 for vectors which are not similar.
A cosine match is implemented by forming the vector for the TOC data in a query, normalizing it to unit length, and performing pairwise dot products with all normalized TOC vectors in the database. The maximum dot product indicates the greatest degree of similarity.
Difference MatchA difference match is typically computed on un-normalized vectors. The difference match is computed as:
Σ(Ai−Bi)̂2
or
Σ|Ai−Bi|
Where Ai represents the ith element of vector A, and Bi represents the ith component of vector B and the summation occurs over all dimensions of the vector (i=0 to n−1) for n-dimensional vectors.
Popularity Match OptimizationAn optimization to vector matching is to perform approximate matching on only the most popular CDs first. If a cosine score above a predetermined threshold is obtained, a match is likely and the results can be returned. Otherwise, vector comparisons are performed against the balance of the database.
Total Disc Length Match OptimizationAnother optimization to vector matching is to retrieve the TOC vectors for all discs that have similar total lengths. This constrains the number of vector comparisons that must be performed to produce a match.
Multiple Metadata SourcesIn one embodiment, the present invention provides mechanisms for accessing, integrating, and resolving metadata from two or more sources. For example, metadata may be available from a commercial source, such as for example All Media Guide (AMG) of Ann Arbor, Mich., as well as from a database containing user-entered information. The commercial source may be deemed more reliable than the user-entered database. Thus, in one embodiment, the system retrieves metadata from the commercial source when available, but falls back on the user-entered database when the commercial source is not able to provide the requested metadata.
Any number of data sources may be integrated in this manner. Tiers of relative credibility can be established, so that the system favors those sources that have higher credibility, and only uses lower-credibility sources when the metadata is not available elsewhere. For example, some metadata databases include metadata that has been translated from one language to another. In general, translated metadata is deemed less credible than metadata that is in its original language. Thus, the system of the present invention can be configured to favor non-translated metadata over translated metadata.
As another example, when user-entered metadata has been corroborated by two or more independent sources (for example, if two or more users have entered matching metadata), it is deemed more credible than user-entered metadata that has not been corroborated.
Referring now toFIG. 10, there is shown an example of an implementation of the present invention in which severalseparate metadata databases106 are provided. Eachdatabase106 is associated with a tier designation that denotes the relative credibility of the metadata therein. Whenclient104 requests metadata,server105accesses databases106 in a tiered manner according to the techniques set forth below. For example,metadata databases106 may include a commercial metadata database, corroborated metadata database, and uncorroborated metadata database.
Referring now toFIG. 11, there is shown another embodiment, wherein asingle database106 containsmetadata records1101 having different levels of credibility.Database106 may include metadata records from various sources that have been merged, such as commercial metadata, corroborated user-entered metadata, non-corroborated metadata, and the like. Eachrecord1101 includes atier designation1102 indicating its credibility level.Tier designation1102 can take any form, such as for example a numeric indicator of credibility level. Metadata credibility is thus tracked on a record-by-record basis. Whenclient104 requests metadata,server105 tries to obtain the metadata using higher-tiered records before resorting to lower-tiered records.
One skilled in the art will recognize that the invention can be practiced using any of a number of different types of physical and logical database configurations and architectures, and that the depictions ofFIGS. 10 and 11 are merely exemplary.
Associating and LinkingIn some situations, one ormore metadata databases106 may be missing useful pieces of information. For example, referring now toFIG. 12, there is shown an example whereincommercial database106B containsalbum titles1202,artist information1203, andsong titles1204, but is missing TOC-basedDisc IDs1201 for at least some CDs. Such omissions may occur, for example, if TOC data was not available for some CDs whencommercial database106B was first created. Thus, when a user insertsCD101,server105 is not able to locate a matching record incommercial database106B using information derived from the TOC ofCD101.
The present invention provides several techniques for addressing this problem. One technique is to look for the desired metadata and TOC-based Disc ID another database, such as a user-entereddatabase106A. Often, however,commercial database106B contains higher-quality metadata than does user-entereddatabase106A (as shown inFIG. 12, wheredatabase106A contains several typographical errors and misspellings). Thus, usingdatabase106A entails a sacrifice in quality.
In one embodiment, therefore, the invention employs the technique illustrated by the example ofFIG. 12. Upon receiving a TOC-based query fromclient104, ifcommercial database106B yields no results,server105 issues afirst database query1210 to user-entereddatabase106A using the TOC-based Disc ID received fromclient104. Once thefirst query results1211 are received, data from thoseresults1211 such as album title and/or other unique data) are used to generate asecond database query1212.Second database query1212 may be based, for example, on album title and artist.Second database query1212 is used to look up the higher-quality record incommercial database106B.Results1213 are returned toserver105, which can then provide the results toclient104. In a variation on this embodiment, the entire metadata record or significant portions thereof can be used as the second database query. For example, the album title, artist, and the track list is used asquery1212. Approximate string matching techniques are used in this query to ensure that each track in matching records matches at least approximately. Thus, for example, the track lists for the album titled Abacab indatabase106A and106B would match.
In one embodiment, linkingdatabase509 is also provided. Linkingdatabase509 includes records that link items of information fromdifferent databases106. For example, oncesecond query results1213 are received byserver105,server105 creates a new record indatabase509 including title andTOC data1220. Thus, in the future,server105 can consult linkingdatabase509 to more efficiently find missing pieces of information such as TOC data, without having to perform queries onmultiple databases106A,106B, etc.
Referring now toFIG. 13, there is shown an example whereincommercial database106B is missing TOC-basedDisc IDs1201, and wherein a user-entered album title1301 is used to locate the desired record. In this example, when the TOC-based query yields no results,client104 prompts the user to enter album title information. Alternatively, the user could be asked to provide artist information, or to scan or type in the UPC from the album cover. Based on the user-entered information,server105constructs query1302. Runningquery1302 ondatabase106B yields queryresults1303 that include metadata from the desired record. The metadata is then transmitted toclient104.
In one embodiment,server105 creates a record in linkingdatabase509, containing the title, or some other unique identifier of the record indatabase106B, andTOC data1220. Thus, in the future, if thesame CD101 is inserted,server105 can query linkingdatabase509 with the TOC ofCD101 to look up the title of the album or other identifier, and then use the album title or identifier to find the desired record indatabase106B. This obviates the need for additional user input.
In an alternative embodiment, whereinserver105 has write privileges fordatabase106B, oncequery results1303 are retrieved,server105 writes the missing information (such as the TOC-based Disc ID) in the appropriate record ofdatabase106B.
In yet another embodiment, depicted inFIG. 14, oncequery results1303 are retrieved,server105 creates a new record in anauxiliary database106C, copying the record fromdatabase106B and also adding missing information such as TOC-based Disc ID. Thus, future lookups for the CD can be performed onauxiliary database106C instead of oncommercial database106B.
Referring now toFIG. 15, there is shown an example wherein database106 (which can becommercial database106B or some other database) storesalbum titles1202,artists1203, andaudio signatures1504. Anaudio signature1504 is an identifier that can be derived from audio data such as a music file on aCD101. In one embodiment,audio signatures1504 for use by the present invention are derived from audio information using the music information retrieval techniques described in related U.S. patent application Ser. No. 10/167,807 for “Music Information Retrieval,” filed Jun. 11, 2002, the disclosure of which is incorporated herein by reference. In another embodiment, audio signatures are derived from music files using any known technique of feature extraction or algorithmic processing on the digital files. One such technique, developed by Fraunhofer Institute IIS, Ilmenau, Germany, takes an average spectrum over some period of time and then derives spectral flatness measures in each of several bands. These spectral flatness measures are then quantized using a probabilistic quantizer into an 8-bit quantity. The result is a highly compressed representation of the original signal that has the property that alternative versions of the same signal will have similar signatures. Another audio signature technique is available in a product called TRM, available from Relatable, LLC of Alexandria, Va.
In the example ofFIG. 15,database106 is missing TOC-basedDisc IDs1201 for some records. One or more audio signature extracted fromCD101 is used to locate the desired record. In this example, when the TOC-based query yields no results,client104 automatically extracts an audio signature fromCD101. Based on the audio signature(s),server105 constructs query1502. Running query1502 ondatabase106 yields queryresults1303 that include metadata from the desired record. The metadata is then transmitted toclient104.
In one embodiment,server105 creates a record in linkingdatabase509, containing the audio signature(s),TOC data1220, and album title or other unique identifier. Thus, in the future, if thesame CD101 is inserted,server105 can query linkingdatabase509 with the TOC ofCD101 to look up the title of the album or other identifier, and then use the album title or other identifier to find the desired record indatabase106B. This obviates the need for additional audio signature generation or lookup.
In one embodiment,server105 creates a record in linkingdatabase509, even when a TOC-based Disc ID is present indatabase106B. Thus,database509 contains a robust set of associations between audio signatures and TOC-based Disc IDs. Such association can be useful for future queries, or for increasing the level of confidence when there is some doubt as to the accuracy of a TOC-based Disc ID. In some cases, an audio signature can map to more than one TOC-based Disc ID; for example, if an audio signature is associated with a song that appears on more than one CD (for example, on a studio release and then on a compilation album), records in linkingdatabase509 may link the audio signature for the song with TOC-based Disc IDs for each of the CDs on which the song appears. Such additional links may be useful, for example to provide a user with a list of CDs that contain a particular song of interest.
One skilled in the art will recognize that the above described techniques, and the block diagrams ofFIGS. 12-15, depict the operation of the invention in terms of an example, and that such examples are not intended to limit the scope of the claims herein. In particular, although the above examples depictcommercial database106B as missing TOC-based Disc IDs, the present invention can be applied to situations where other pieces of information are missing and/or incorrect. In addition, when TOC data is missing, other techniques for obtaining additional information may be used. For example, a scanner may read the UPC or other bar code from the CD packaging, or the user may provide some other piece of identifying information. In addition, when results of a query are ambiguous or uncertain, the user can be prompted to indicate which of several results is the desired result or to confirm a correct result. The creation of an entry in the linking database may be dependent on the confirmation by one or more users that the association is correct. One skilled in the art will recognize that additional variations are possible, without departing from the essential characteristics of the invention.
User-entered information and/or audio signatures can also be used to resolve any ambiguity resulting from variations in TOC data. It is known that TOC data can vary from one copy to another of a particular disc; these variations can result from slight differences among pressings of the disc, or from differences in CD drives reading the discs. In many cases, the above-described vector matching techniques are able to detect approximate matches and thereby select those metadata records that are most likely to be pertinent to the inserted CD. Once those close matches have been identified, it is useful to confirm that one (or more) of the close matches is in fact the appropriate record. In one embodiment, therefore, once a set of close matches indatabase106 have been found,client module104 presents the closely matching metadata record(s) to the user and prompts the user to confirm which (if any) of the records is a correct match.Server105 then creates a record in linkingdatabase509, to associate the TOC data of the inserted CD with the confirmed record(s) indatabase106. The TOC data derived from the inserted CD is referred to as a “TOC variant,” since it does not exactly match the TOC data originally associated with the metadata record. Subsequently, if the user (or another user) inserts a CD having the TOC variant,server105 can determine, from linkingdatabase509, which record(s) indatabase106 is pertinent, and retrieve the record(s) without having to confirm with the user. In one embodiment,database106 includes an indicator of a credibility level with respect to a metadata record; if more than one user corroborates the association of the TOC variant with the same metadata, the credibility level of the metadata record increases.
In an alternative embodiment, rather than prompting the user for confirmation,server105 compares metadata from the closely-matching records with user-entered CD information. If the user-entered CD information matches the metadata in a record, that record is considered to be confirmed.Server105 then creates a record in linkingdatabase509, to associate the TOC data of the inserted CD with the confirmed record(s) indatabase106. Again, the TOC of the inserted CD is considered a TOC variant, and subsequently inserted CDs having the TOC variant can be recognized without additional confirmation.
In yet another embodiment,server105 compares audio signatures from the closely-matching records with an audio signature obtained from the inserted CD. If the audio signature obtained from the inserted CD matches the audio signatures from a closely-matching record, that record is considered to be confirmed.Server105 then creates a record in linkingdatabase509, to associate the TOC data of the inserted CD with the confirmed record(s) indatabase106. Again, the TOC of the inserted CD is considered a TOC variant, and subsequently inserted CDs having the TOC variant can be recognized without additional confirmation.
In any of the above-described embodiments, instead of creating a record in linkingdatabase509, ifserver105 has write privileges fordatabase106, it can create one or more new records indatabase106 to associate the TOC variant with the metadata from the matching metadata record(s). Alternatively,server105 can create one or more such records in anauxiliary database106C to associate the TOC variant with the metadata from the matching metadata record(s). Alternatively, ifserver105 has write privileges fordatabase106, it can add the TOC variant to existing metadata record(s) indatabase106, so thatserver105 can recognize the TOC variant without reference to any other databases.
Referring now toFIG. 5, there is shown a system for implementing the above-described techniques of linking and associating metadata from two or more sources, according to one embodiment of the present invention. In the exemplary system shown inFIG. 5, the invention uses linkingdatabase509 to link user-entered metadata stored indatabase106A with commercial metadata stored indatabase106B.Commercial metadata database106B contains, for example, content that has been purchased or licensed from a source of music information, such as for example All Media Guide (AMG) of Ann Arbor, Mich. One skilled in the art will recognize that the techniques of the present invention can be applied to metadata derived from any source.
In one embodiment, the present invention is implemented using three major system components: client software102 (including metadata client module104);metadata server105; anddisc match module507. In the following description,metadata server105 handles incoming metadata submissions, such as user-entered metadata; however, in an alternative embodiment, a separate module (not shown) handles such submissions.
Linkingdatabase509 contains mappings from TOC based Disc IDs that are used indatabase106A to record identifiers in thedatabase106B.Server105 usesdatabase509 us to find data in thedatabase106B, given a Disc ID fordatabase106A The following in an example of a data format used to store the database in a file. In this example, records are separated by lines consisting of a single period (.). A TOC Identifier is found on lines beginning with “#0+” and a “DISCID=<some number>” denotes the record identifiers indatabase106B or106A (the leading digit of the identifier indicates which database to use.)
|
| # 0+6+183+49120+78783+102020+167320+200438+227175+3029 |
| DISCID=4050f0ea |
| . |
| #0+10+182+17072+36722+51550+70497+87785+113515+138755+155952+175835+198332+22 |
| DISCID=00036b66 |
| . |
|
Disc Match module507 initially populates linkingdatabase509. It matches user-entered metadata with the metadata incommercial database106B to establish the mappings. Techniques for determining these matches are described in more detail below in connection withFIG. 8.
In one embodiment,client module104 andserver105 provide additional functionality to minimize the amount of data a user might need to enter for an unrecognized disc, as will be described in more detail below. For example, when theserver105 indicates to theclient104 that the disc is unrecognized, theclient module104 prompts the user to enter the artist name and/or title of the disc. This information is sent back toserver105, which uses it in a query against linkingdatabase509. The results of this query are returned toclient104, which displays them for the user. Should the user accept the results of the query, the disc TOC and the Disc ID are sent fromclient module104 toserver105.Disc Match module507 creates a new association between the TOC and the Disc ID in linkingdatabase509.
Databases106A and106B may be local or remote with respect to metadata application service provider (ASP)103.
ImprovedMetadata Client Module104In one embodiment, the present invention is backward compatible with existing populations ofclient software102. Furthermore, in one embodiment, theimproved client module104 of the present invention is backward compatible with existingmetadata servers105 that do not include the improved functionality set forth herein. In one embodiment, the dialog boxes described below are presented as part of a user interface ofclient module104.
Referring now toFIG. 21, there is shown a flowchart depicting details of a method of handling metadata submission according to one embodiment of the present invention. Referring also toFIGS. 20A through 20D, there are shown screen shots of dialog boxes presented by the user interface ofclient module104 in connection with the method ofFIG. 21, according to one embodiment.
A user inserts aCD2101.Client module104 queries2202metadata server105 to initiate a TOC-based search.Step2103 determines whether multiple matches, a single match, or no matches are found.
If multiple close matches or multiple exact matches are found,client module104displays2115 the matches. For example, referring also toFIG. 20D,client module104 presentsdialog box2020 to allow the user to select among the matches. Depending on the user'saction2116,client module104 proceeds to step2107,2104, or2117. The user can select a matching listing infield2021, and clickSelect button2022 to proceed to step2107 as described below. If none of the listings match, the user can click onSearch button2023 to go to step2104 as described below, or the user can click on Cancelbutton2024 to dismissdialog box2020 and indicate2117 that no metadata is to be associated with the inserted CD.
In one embodiment, if the TOC-based search yields no results,client module104 proceeds to step2104 to initiate an artist/title search, as described below.
In one embodiment, if the TOC-based search yields one match,client module104 determines2113 whether it is an exact match. If so,client module104 assumes the match is correct, and uses2114 the metadata from the matching record. If it is not an exact match,client module104 proceeds to step2107 as described below.
Instep2104,client module104 displays a dialog box, such asdialog box2000 shown inFIGS. 20A and 20B, depicting examples ofdialog boxes2000 forsearch function2104 according to one embodiment of the present invention.Dialog box2000 prompts the user to enter information inartist field2001 and/oralbum title field2002 as parameters for a search for matching metadata records, and shows search results (if any) inarea2004.
User action2105 determines the next step. If the user enters artist and/or title information infields2001 and2002 and clicks onsearch button2003 to initiate a search,client module104queries2106server105 by artist and/or title, and returns to step2104 to display results.
If, once results are displayed, the user selects one or more of the displayed search results (by highlighting the result(s) and clicking on select button2005),client module104, proceeds to step2107 as described below.
If the user clicks on Not foundbutton2006 to indicate that none of the listed results are correct,client module104 may give the user an opportunity to manually enter track information.
The user can also click on Cancelbutton2007 to dismissdialog box2000 and indicate2117 that no metadata is to be associated with the inserted CD.
Instep2107,server105 provides detailed metadata for the selected record(s) and provides the user with an opportunity to confirm the displayed metadata. Referring also toFIG. 20C, there is shown a screen shot depicting an example of adialog box2010 for confirming CD information according to one embodiment of the present invention. Metadata for the selected record is displayed, includingartist2011,album title2012, genre2013, andtrack listing2014.
The user can edit2109 the displayed metadata to correct any errors. If so,client module104 transmits2111 the edits, along with TOC, toserver105 for submission verification.Metadata server105 is then able to link the TOC with the new or existing metadata record, as described in more detail below in connection withFIG. 7.Client module104 then proceeds to use2112 the edited metadata.
If the user clicks theOK button2016 to confirm the displayed metadata as correct, without making edits,client module104 posts asubmission2110, including the TOC of the inserted CD and the Disc ID, tometadata server105.Metadata server105 is then able to link the TOC with the new or existing metadata record, as described in more detail below in connection withFIG. 7.
Client module104 then proceeds to use2112 the edited metadata.
The user can click on Cancelbutton2017 to dismissdialog box2010 and indicate2117 that no metadata is to be associated with the inserted CD.Back button2015 returns toprevious screen2000.
Query FormatIn one embodiment,client module104 communicates withserver105 using HTTP over the Internet. Requests for metadata are made using HTTP ‘GET’ requests to a uniform resource locator (URL) associated withserver105. The request includes Disc ID information, such as a listing of track lengths and other unique identifying information. Such information can be provided in any format. One skilled in the art will recognize that metadata requests can be implemented using any known techniques and formats without departing from the essential characteristics of the present invention.
Response FormatQuery ResponsesIn one embodiment, the relevant data is returned in the body of an HTML response. The first line of the body contains a code indicating the type of return. Return codes include:
- 200=exact matches
- 210=inexact matches
- 211=202=no match for disc
The format for 200 and 210 codes (multiple matches) contains a single header line with the response code (and descriptive text that should be ignored) followed by multiple lines containing the query results. The list is terminated by a line containing a single ‘.’. The following is an example of a response for a single exact match:
| |
| 200 Found exact matches, list follows (until terminating ‘.′) |
| DINDEX0=404782b7 |
| DTITLE0=Crash |
| DARTIST0=Dave Matthews Band |
| DGENRE0=Rock/Pop |
| DYEAR0=1996 |
| DNUMBER0=1 |
| DTOTAL0=1 |
| DARTID0=hi.492215 |
| . |
| |
The following is an example of a response where multiple items is returned:
| |
| 200 Found exact matches, list follows (until terminating ‘.′) |
| DINDEX0=4054819b |
| DTITLE0=The Eminem Show |
| DARTIST0=Eminem |
| DGENRE0=Rap/R&B |
| DYEAR0=2002 |
| DNUMBER0=1 |
| DTOTAL0=1 |
| DARTID0=hi.1343899 |
| DINDEX1=40549115 |
| DTITLE1=The Eminem Show [Clean] |
| DARTIST1=Eminem |
| DGENRE1=Rap/R&B |
| DYEAR1=2002 |
| DNUMBER1=1 |
| DTOTAL1=1 |
| DARTID1=hi.1347861 |
| . |
| |
The response contains metadata designed to help the user choose the appropriate record when multiple items are returned. In general, the attributes returned are of the format:
Name<index>=Value;
where <index> is a number that binds attributes belonging to the same record together.
The following named attributes are returned:
DINDEX—The unique metadata record identifier (Disc ID) that should be used in a “read” request to return the full set of metadata
DARTIST—The name of the Artist
DTITLE—The title of the Album
DGENRE—The genre associated with the album.
DYEAR—The year the album was first released
DNUMBER—The disc number if this is part of a multi-disc set
DTOTAL—The total number of discs if this is a multi-disc set.
DARTID—An identifier that can be used to retrieve the album art image using a different request.
Read ResponsesThe response to a read request consists of a header line followed by Name=Value pairs (one per line) and terminated by a line consisting of a single ‘.’ The header line consists of the code210, followed by the category and disc-ID that were passed to the read request. The following names are returned:
DISCID—The Id of the returned disc . . . . The same as the argument to the READ command.
DTITLE—The title of the disc . . . Typically <ARTIST>/<ALBUM>
DARTIST—The artist associated with the disc.
DALBUM—The album name associated with the disc.
DYEAR—The year of the first release of the album.
DGENRE—The genre associated with the disc. Examples include: Avant Garde, Bluegrass, Blues, Cajun, Celtic, Children's, Classical, Comedy, Country, Easy Listening, Electronica, Environmental, Exercise, Folk, Gay, Gospel, Holiday, Jazz, Latin, Marches, Miscellaneous, New Age, Rap/R&B, Reggae, Rock/Pop, Sound Effects, Soundtrack, Spoken Word, Vocal, Women's, World
TTITLE<N>—The track title for track <N> where <N> is from 0 to the number of tracks on the disc −1.
TARTIST<N> (protocol5 only)—The artist associated with track <N> where <N> is from 0 to the number of tracks on the disc −1. Multiple artists are separated by “/”.
An example of a read response is as follows:
| |
| 210 category 0003851f |
| DTITLE=Get Rich Or Dye Tryin' |
| DARTIST=50 Cent |
| DYEAR= |
| DGENRE=Rap/R&B |
| DNUMBER= |
| DTOTAL= |
| DALBUMID= |
| DARTISTID= |
| DARTID= |
| TTITLE0=Intro |
| TARTIST0=50 Cent |
| TTRACKID0= |
| TTITLE1=What Up Gangsta? |
| TARTIST1=50 Cent |
| TTRACKID1= |
| TTITLE2=Patiently Waiting |
| TARTIST2=50 Cent ft Eminem |
| TTRACKID2= |
| TTITLE3=Many Men (Wish Death) |
| TARTIST3=50 Cent |
| TTRACKID3= |
| TTITLE4=In Da Club |
| TARTIST4=50 Cent |
| TTRACKID4= |
| TTITLE5=High All the Time |
| TARTIST5=50 Cent |
| TTRACKID5= |
| TTITLE6=Heat |
| TARTIST6=50 Cent |
| . |
| |
Submission FormatIn one embodiment, metadata submissions are processed via an HTTP POST. The posted data is in the same format as the read response. An example of a submission post is as follows:
| |
| # Proto: 5 |
| # hello:Source=MMJB+MMJB_KEY=&MMUID={12345678-ABCD-1234-ABCD- |
| 1234567890AB}&grant=1&VERSION=7.20.0169Gateway&OEM=Gateway&OOEM=Gateway&LANG=ENU |
| # MMJB::Category: Soundtrack |
| # MMJB::Discid: 0 |
| # Track frame offsets: |
| # 150 |
| # 22790 |
| # 45523 |
| # 51973 |
| # 68145 |
| # 82055 |
| # |
| # Disc length: 1300 seconds |
| # |
| DISCID=0 |
| DTITLE=8 Mile [Deluxe Edition] (2 of 2) |
| DGENRE=Soundtrack |
| DARTIST=50 Cent |
| TTITLE0=Rap Name |
| TARTIST0=Obie Trice |
| TTITLE1=Stimulate |
| TARTIST1=Eminem |
| TTITLE2='Til I Collapse Freestyle |
| TARTIST2=50 Cent |
| TTITLE3=Gangsta |
| TARTIST3=Beast, Joe |
| TTITLE4=The Weekend |
| TARTIST4=Brooklyn |
| TTITLE5=California |
| TARTIST5=Shaunta |
| |
One skilled in the art will recognize that the above formats and layouts are merely examples, and that the present invention can be practiced with any other format or layout for queries, responses, and submissions.
Improved Metadata Server105In one embodiment,server105 provides new services and functionality to next-generation clients while maintaining backward compatibility with existing clients. To achieve this,server105 implements and responds to existing metadata request protocols, as well as a new protocol, described below, to support the enhanced features and metadata for next-generation clients.
Upon receiving a metadata request,server105 attempts to use metadata fromcommercial database106B when available. Ifdatabase106B does not contain the requested metadata,server105 uses user-enteredmetadata database106A as a fallback. For example, for new CDs that have not yet been entered incommercial database106B, or for promotional discs, imports, and bootleg CDs that may never find their way intocommercial database106B,server105 relies on user-entereddatabase106A.
In addition, if a user indicates that metadata fromdatabase106B contains an error, the corrected information can be stored indatabase106A. When responding to future requests,server105 provides the corrected metadata.
In order to implement such functionality,server105 matches metadata requests fromclient modules104 with records indatabases106A and106B, and further associates and links records indatabase106A with records indatabase106B. In this manner,server105 is able to identify records in both databases that refer to the same CD.Server105 uses information from linkingdatabase509 to perform such associating and linking. The following are examples of methods and implementations for matching, associating, and linking metadata requests,database106A records, anddatabase106B records. One skilled in the art will recognize that these techniques can be applied in any context where matching, associating, and linking of two or more database records are desired.
CD Metadata Submission ProcessingReferring now toFIG. 7, there is shown an improved server method for adding metadata information to a database, according to one embodiment of the present invention. The improved method of the present invention uses user submissions to create new Disc ID to Album ID mappings, so as to resolve ambiguities and to identify potential errors in the commercial metadata.
In one embodiment,server105 performs the steps of the method ofFIG. 7 when a user manually enters metadata. For example, ifserver105 fails to find a matching record indatabase106A or106B in response to a metadata query,client software102 may prompt the user to manually enter metadata.Client module104 then transmits the metadata toserver105, which then performs the steps ofFIG. 7.
Metadata server105 receives701 metadata, typically via HTTP over an Internet connection.Server105 assesses the type ofsubmission702. If the submission represents a correction to existing metadata, in oneembodiment server105 submits707 the corrections for editorial review (either manual, automated, or both). Based on the results of the editorial review, the correction is applied to theappropriate database106A or106B, or it is rejected. In one embodiment, ifserver105 is not able to write tocommercial database106B, the correction may be entered in user-entereddatabase106A. Subsequently, whenever that metadata record is to be retrieved, the corrected metadata is retrieved from user-entereddatabase106A, in lieu of or in addition to the commercial metadata fromdatabase106B.
If the received submission represents new metadata,server105 attempts to match704 the submitted metadata to a record incommercial database106B. In one embodiment, the vector matching techniques described above are employed. In another embodiment, the text of the tracks is matched with candidate albums from the commercial database using an Information Retrieval index. If a match is found708,server105 creates709 a new TOC variant indicating a link between the detected TOC and the Disc ID of the matching record, and stores the new TOC variant as a record in linkingdatabase509. If no match is found,server105stores710 the new metadata as a new record in user-entereddatabase106A.
Additional detail for an implementation of the steps ofFIG. 7 according to one embodiment are provided in connection withFIG. 19, below.
CreatingLinking Database509In one embodiment, the inventionpre-populates linking database509 so thatserver105 can use commercial data where available, but fall back on user-entered data when commercial data is unavailable or inaccurate. In this manner, the invention provides more accurate information while avoiding the need to rely on users to enter a large amount of data.
In one embodiment, the invention uses TOC data from a provider of commercial metadata, such as AMG.Disc match module507 creates a vector for the TOC data provided by AMG, maps it to the associated album ID, and stores the mapping indatabase509 so thatserver105 can retrieve the corresponding record fromdatabase106B when needed.
Alternatively, the invention uses a known collection of CDs to pre-populatedatabase509.Disc match module507 extracts TOC data from these CDs. Ifcommercial database106B includes universal product code (UPC) numbers, the CD TOC data is mapped to the commercial database using UPCs on individual disc cases. In one embodiment, a barcode reader is used to speed up the process of inputting UPC numbers.
Referring now toFIG. 8, there is shown an example of a method for creating mappings between TOC Based Disc IDs and CDs, as performed bydisc match module507 according to one embodiment of the present invention. The method is shown in the context of populatinglinking database509 using a collection of CDs. One skilled in the art will recognize, however, that the method can be adapted to populatedatabase509 using data from other sources.
The user insertsCD101 into a drive. A component ofclient software102extracts801 the TOC from the disc and forms802 a vector as described above. A barcode reader (not shown) scans803 the UPC barcode on the CD package. Alternatively, the user could be prompted to enter a title, code, or other information that uniquely identifies the CD. Alternatively, the system extracts audio signatures for one or more songs on the CD.Disc match module507 uses the scanned UPC, or audio signatures, or other information, to retrieve an album ID or metadata fromcommercial database106B.Module507 then generates806 a record associating the vector with the retrieved album ID and/or metadata. The generated record is stored indatabase509.
Metadata Server105In one embodiment,metadata server105 is implemented as an application server that accepts requests delivered via HTTP, performs the metadata lookups, and returns the data via an HTTP response. For example,metadata server105 may be implemented as an instance of Tomcat, an application server available from the Apache Software Foundation of Forest Hill, Md. The Tomcat instance is extended with Java servlets to implement the functionality of the present invention.
Referring now toFIG. 9, there is shown an example of an implementation of metadata ASP130 includingmetadata server105 according to one embodiment of the present invention. Metadata is organized into memory-mappedfiles905 andindexes907. Eachindex907 associates a vector with an offset into a memory-mappedmetadata file905. The records inmetadata file905 contain the information needed to fulfill a query request. There may be multiple metadata records associated with a single offset in the case of ambiguous results for a Disc ID.
In one embodiment, there aremultiple indexes907, such as for example:
- 1. Commercial TOC vectors mapping to commercial metadata;
- 2. User-submitted TOC vectors mapping to commercial metadata;
- 3. User-submitted TOC vectors mapping to user-submitted metadata;
- 4. Album title and artist name text mapping to commercial metadata; and
- 5. Album title and artist name text mapping to user-submitted metadata.
In one embodiment, when retrieving metadata according to the techniques described above in connection withFIG. 6,server105 attempts to find exact vector matches in eachindex907 in a specific order (for example, first scanning commercial TOC vectors mapping to commercial metadata, then scanning user-submitted TOC vectors mapping to commercial metadata, then scanning user-submitted TOC vectors mapping to user-submitted metadata).Server105 returns the metadata for the first exact match found. In one embodiment, the scanning process involves calculating the vector distance between the query vector and each vector in the database. An optional performance optimization is to skip the vector difference calculation if the total disc length in seconds as reported in the TOC data does not match within a given configuration threshold. An optional performance optimization is to skip the vector difference calculation if the number of tracks reported in the TOC data does not match. An “exact” match is a match for which the vector difference is smaller than a predetermined threshold.
User submissions903 are sent to asubmissions database906 for processing by asubmission processing module904 according to the techniques ofFIGS. 7 and 19.
Metadata Indexes907Metadata indexes907 are constructed whenmetadata server105 starts up. They are updated periodically as data is appended to user-enteredmetadata database106A (based on new submissions).Multiple indexes907 are created in order to define the search order of the metadata. The indexes are created, for example, by reading the contents of thecommercial metadata database106B and user-enteredmetadata database106A. There may be multiple files for each of these sources corresponding to different commercial providers, different languages, or different levels of credibility for user-submitted data (corroborated and uncorroborated).
In one embodiment, two types ofindex907 are created: a TOC index and a text index. A TOC index is created by reading the metadata files and creating a TOC vector for each TOC string that is encountered in the file, and associating it with the Disc ID of the associated record. The index can be created as a simple list of (TOC, Disc ID) object pairs. A scanning/retrieval operation is then performed, consisting of iterating through the list of TOCs in the index and performing a vector comparison with the query TOC. The scanning/retrieval operation returns the Disc IDs associated with the TOCs that compare most closely to the query.
The second type of index is a text- or word-based index that is used for artist/title text-based searches. This is a standard text-based information retrieval (IR) index that is well known in the art of text searching applications. The index consists of data structures that map the words found in the artist and album titles inmetadata database106 to the Disc IDs with which they are associated. A title query on the index consists of finding those Disc IDs that are associated with the most words matching the queries.
Other types of indexes can be built based on available linking data. For example, an audio signature index could be built that maps the combination of audio signature and position from which the signature was taken to Disc IDs.
Disc ID FormatIn one embodiment, the Disc ID is an 8-digit hexadecimal number that is mapped to the metadata for a record. The most significant bit or bits are used to segment the ID space for multiple data sources. For example, a Disc ID having a most significant bit value of 1 might correspond to a record fromcommercial database106B, while a record where the most significant bit value is 0 might correspond to a record from user-entereddatabase106A. In general, the Disc ID is opaque toclient module104 andserver105; the data source defines the Disc IDs, andclient module104 andserver105 use them with no knowledge of how they were generated.
The Disc ID encodes the byte offset of the desired record in the metadata file. Records are word-(16 bit) aligned, so that addresses are even. The least significant bit is used to indicate if the record should be returned fromcommercial metadata file106B (lsb=0), or user-enteredmetdata file106A (lsb=1). The Disc ID is opaque to the client.
Metadata File FormatThe following is an example of a metadata file format formetadata files106A and106B. One skilled in the art will recognize that any file format may be used.
Metadata files106A and106B consists of CD metadata records. In one embodiment, the format of records infiles106A and106B is compatible with the CDDB1 protocol response.
The comment header present in the XMCD files is not present in these records except for an optional revision number.
A line consisting of only a period ‘.’ is used to separate multiple records.
In one embodiment, the Disc ID used in the file is not the CDDB1 ID, but rather a unique ID derived for each metadata record in a data source such that the ID space does not overlap with any other data source.
|
| #0+10+150+24948+50538+78725+92010+123080+155748+198053+211775+235358+4519 |
| DISCID=40460e60 |
| DTITLE=Acid Jazz, Vol. 2 |
| DARTIST=Various Artists |
| DYEAR=1995 |
| DGENRE=Jazz |
| DNUMBER=1 |
| DTOTAL=1 |
| DARTID=396896 |
| DALBUMID=396896 |
| TTITLE0=Super Bad |
| TARTIST0=Idris Muhammad |
| TTRACKID0=8558080 |
| TTITLE1=Cold Sweat |
| TARTIST1=Bernard “Pretty” Purdie |
| TTRACKID1=8558081 |
| TTITLE2=Wildfire |
| TARTIST2=Rusty Bryant |
| TTRACKID2=8558082 |
| TTITLE3=Hot Barbecue |
| TARTIST3=Jack McDuff |
| TTRACKID3=8558083 |
| TTITLE4=Reelin' With the Feelin' |
| TARTIST4=Charles Kynard |
| TTRACKID4=8558084 |
| TTITLE5=Spinky |
| TARTIST5=Charles Earland |
| TTRACKID5=8558085 |
| TTITLE6=Who's Gonna Take the Weight? |
| TARTIST6=Melvin Sparks |
| TTRACKID6=8558086 |
| TTITLE7=Mamblues |
| TARTIST7=Bernard “Pretty” Purdie / Cal Tjader |
| TTRACKID7=8558087 |
| TTITLE8=The Twang Thang |
| TARTIST8=Billy Butler |
| TTRACKID8=8558088 |
| TTITLE9=Haw Right Now |
| TARTIST9=Patrice Rushen |
| TTRACKID9=8558089 |
| . |
|
Detailed Metadata Service ArchitectureReferring now toFIG. 16, there is shown a block diagram of a detailed system architecture according to one embodiment of the present invention. As described above,client module104 issues metadata queries, and transmits metadata submissions, tometadata server105.FIG. 16 provides additional detail as to the processing of metadata submissions once they are received byserver105, according to one embodiment.
Upon receipt of a metadata submission,server105 writes the submitted metadata to logfile1601.Log data collector1604 filters and collects the log information fromlog file1601, and periodically posts the data to logconsolidator1605. In embodiments where more than onemetadata server105 are provided,log consolidator1605 consolidates events that have been processed ondifferent servers105. For example,consolidator1605 accepts events from multiplelog data collector1604 instances, orders the events by user and time, and passes the events to corroborator1603 in a convenient form.
Log consolidator1605 transmits consolidated data tocorroborator1603, which determines whether submitted metadata has been corroborated by two or more independent sources.Corroborator1603 publishes corroborated information to user-entereddatabase106A. If discrepancies are detected between submissions from two or more sources,corroborator1603 identifies the discrepancies and, in one embodiment, provides a list of discrepancies to Quality Assurance (QA)tool1606, for presentation to an administrator. In one embodiment, two or more user-entereddatabases106A are provided: one for uncorroborated metadata, and one for corroborated metadata. In another embodiment, onedatabase106A is provided, and records are flagged to indicate whether the metadata they contain is corroborated or uncorroborated. As described above,server105 favors corroborated metadata over uncorroborated metadata, so as to provide toclient module104 metadata of the highest possible level of credibility.
In one embodiment,Quality Assurance Tool1606 is available to allow administrators to manually inspect and modify submission information that has been flagged for special attention.QA tool1606 may provide, for example, a user interface that allows an editor to view submissions that conflict with previously stored metadata, or to examine other submissions that are flagged for manual review. The user interface ofQA tool1606 provides administrator controls for ignoring/deleting submissions, applying submissions to correct previously stored data, or defer action pending additional information. The user interface may also provide the administrator with the means to provide comments as appropriate.
Databuild module1607 keeps thecommercial database106B in sync with user-entereddatabase106A.Databuild module1607 processes updates todatabases106, and produces new metadata records fordatabases106. Providers of commercial sources of data are constantly updating their data with new and corrected entries. The format of a commercial database may vary from one commercial source to another.Databuild module1607 processes these updates in the vendor-specific formats. In one embodiment,databuild module1607 consolidates the data from multiple feeds such that data for the same artist or album is indexed by a single, unique identifier regardless of which source provided the data.Databuild module1607 also republishes the commercial data in the format described above fordatabase106B, that is suited for use bymetadata server105. As new data is published by the commercial providers and becomes available to the system, duplication between the commercial and user-submitted data may occur.Databuild module1607 eliminates redundant entries from the user entereddatabase106A.Databuild module1607 also merges approved corrections intodatabases106, in accordance with the methods described below in connection withFIG. 19.
Log data collector1604,log consolidator1605,corroborator1603,QA tool1606, anddatabuild module1607 may be implemented, for example, as software components ofmetadata ASP103.
Referring now toFIGS. 17A,17B, and17C, there are shown diagrams depicting the software structure of one embodiment ofserver105 in more detail. Referring now toFIG. 17A, there is shown the software objects in one embodiment ofserver105, specifically an embodiment based on a Java servlet environment such as the Apache Tomcat environment.HttpServlet object1701 accepts HTTP requests through the Tomcat or other servlet environment. This object is extended to handle the specific protocols described above.CDIIndex object1702 manages the various types ofmetadata indexes907.TocDB1705 implements a TOC-based metadata index.Index1706 implements a title (word) based index.CDIIndex1702 uses an add( )method to add new entries to the indexes (as it does on startup when it reads the metadata databases).CDIIndex1702 uses find( ) to query the indexes using TOC or words as appropriates for the type of index.CDIDataStore1703 encapsulatesmetadata databases106A and106B. Underlying this encapsulation isMappedByteBuffer object1704 which provides access to the memory-mapped file implementation ofdatabases106.
Referring now toFIGS. 17B and 17C, there are shown the class inheritance relationships used in one embodiment ofserver105. The various types of requests handled byserver105 are encapsulated byobjects1713,1714, and1715 inFIG. 17B. The various types of responses generated byserver105 are encapsulated byobjects1721,1722,1723, and1724 inFIG. 17C.
Referring now toFIG. 18, there is shown a block diagram depicting the operation oflog consolidator1605 in more detail.Server105 produceslog file1801 from incoming submissions. There may be many instances ofserver105 running on different machines. Each machine running aserver105 has a daemon program called snarfd1802 which reads a log file as it is written and transmits the data over anetwork connection1803 toCorroborator Server1804, which stores the data1809.Corroborator Server1804 also maintains system monitoring statistics referred to asCricket data1810. Cricket is an OpenSource software package for monitoring and graphing. A daemon process in.snarfcricket1811 reads the Cricket data periodically and posts the data toCricket server1812 which runs Cricket1814 and maintainsgraphs1813 of the system status.
Metadata Submission MethodReferring now toFIG. 4, there is shown a flowchart depicting a server method for adding metadata information todatabase106, according to one embodiment of the present invention.Server105 receives401 a submission request fromclient102.Server105 may perform402 Quality Assurance on the submission, in order to check for completeness, spelling, case usage, and the like. If the submission passesQA403, the new information is stored405 indatabase106. Submissions that do not pass the QA process are discarded404.
Referring now toFIG. 19, there is shown a flowchart depicting a method of handling metadata submission according to another embodiment of the present invention. In one embodiment, the steps ofFIG. 19 are performed bycorroborator1603; in another embodiment, they are performed byserver105 or some other component of the system. In one embodiment the steps ofFIG. 19 are performed automatically when the user enters metadata for an inserted CD. In another embodiment, the steps are performed in a batch mode, after a number of submissions have been collected inlog1601 and are ready for processing.
In the flowchart ofFIG. 19, and in the following description, the term “provisional” is used to indicate a database record that has not been corroborated, and the term “live” is used to indicate a database record that has been corroborated or is part of a commercial database.
First, the system of the invention performs searches ondatabases106, to find1901 matches both by TOC-based Disc ID for the inserted CD and by text on the user-entered information. A text search, in this case, compares the text comprising the metadata of a submission to existing entries in thedatabases106. Specifically, the comparison seeks to determine if the submission contains the same track list, independent of punctuation, capitalization, or other insignificant differences. Users can submit information in a variety of ways, whether or not they entered the data or changed it. Small changes in punctuation or capitalization are ignored. Exact matches and approximate matches are identified, and the steps ofFIG. 19 are performed for each.
If the TOC of the inserted CD exactly matches that of a live (corroborated)record1902, and the user-submitted metadata exactly (or very closely) matches the metadata of therecord1903, the system ignores1904 the submission as redundant. The system then proceeds1905 to the next hit (if any).
If the TOC of the inserted CD exactly matches that of alive record1902, and the user-submitted metadata is close to the metadata of therecord1906, the system marks1907 the submission for manual review viaQA tool1606. The system then proceeds1905 to the next hit (if any).
If the TOC of the inserted CD matches that of a provisional (uncorroborated)record1908, and the user-submitted metadata matches or is close to the metadata of theprovisional record1906, the system upgrades1910 the provisional record to live (corroborated) status. The system then proceeds1905 to the next hit (if any).
If the user-submitted metadata matches or is close to the metadata of alive record1911, the system creates1912 a new provisional record indicating a TOC variant corresponding to the metadata of the live record. The system then proceeds1905 to the next hit (if any). In some cases, a submission includes only a TOC and a matching Disc ID. This is a TOC variant submission and occurs, for example, when the user accepts the result of atext search2010, as described above in connection withFIG. 21. In general, TOC variant submissions occur whenever a lookup succeeds (i.e., is accepted by the end user) and the lookup was a result of inexact TOC match or a non-TOC basedmetadata index907.
Otherwise, the system creates1913 a new provisional record including the user-submitted metadata of the live record. The system then proceeds1905 to the next hit (if any).
In one embodiment,databases106 are not updated in real time. Rather, all updates are performed on in-memory structures within consolidator1605 (or other component), anddatabases106 are periodically updated from the in-memory data.
The following table summaries the actions performed in the various contingencies depicted inFIG. 19:
| | Near/ | | | Near/ | Near/ | |
| | Exact | Miss | Miss | Exact | Exact | Miss |
| Prov TOC | Live TOC | Exact | Exact | Near | Near | Miss | Miss |
|
| Exact | Exact | A | A | B | B | C | D |
| Near/Miss | Exact | A | A | B | B | D | D |
| Near/Miss | Miss | E | E | E | E | D | D |
| Exact | Miss | C | E | E | C | C | D |
|
| Figure | | |
| Code | Element | Condition | Comment |
|
| A | 1904 | Redundant data | Ignore; already inlive |
| | | database |
| B |
| 1907 | Redundant data | Manually review |
| C | 1910 | Corroboration | Additional data for |
| | | provisional entry |
| D |
| 1913 | New Entry Altogether | Any TOC match is just |
| | | misleading |
| E | 1912 | New TOC variant | Add to provisional database |
|
In one embodiment, provisional (uncorroborated) records are not promoted to live (corroborated) status until some degree of credibility for the record has been established. This may entail receiving some predetermined number of matching submissions, or some type of additional verification. In addition, some users may be deemed more credible than others, based on historical accuracy and number of submissions or on other factors; a corroboration “score” may be developed for each provisional record, based on the number of corroborating submissions and/or the relative credibility of the users that provided the submissions. Score can depend on various factors, for example: whether the provisional and corroborating submissions were submitted by different users; whether the provisional and corroborating submissions came from different network (e.g. IP) addresses; whether the submissions came from users that have abused the system in the past.
Records may be promoted to live status when their score exceeds a threshold. In one embodiment, the number of corroborating submissions necessary for promotion can be predetermined, or can be varied based on, for example, whether the submission conflicts with an existing popular entry in the database.
System OperationIn one embodiment, the system of the present invention is further extended to allow retrieval of data from acommercial database106B even if TOC data is not present.
For example, if TOC data is not available, the user can manually enter descriptive information about a CD (such as artist and album name).Server105 searches for and retrieves matching records indatabases106A and106B.Client software102 then prompts the user to confirm whether or not the retrieved records are the correct records. Upon user confirmation, an association can be made between a TOC that was previously associated with the matching records indatabase106A and106B, and the user-entered information. Subsequent TOC searches would then return the metadata for the album directly. In one embodiment, the user-entered information must be corroborated before it is made available as live metadata.
Alternatively, the present invention can be combined with an audio signature algorithm that matches audio signal data. Audio signatures for CD tracks can be associated with the album metadata indatabases106A and106B.Server105 can make such associations any time metadata and TOC data are or become known but audio signatures are not known in the central database. Thus, if a TOC search finds the album, the audio signatures for the tracks in the search can be added to album metadata. If an album and artist name search is used to associate TOC data with metadata, then the audio signatures can be associated at the same time.
Referring now toFIG. 22, there is shown a block diagram of system operation in connection with audio signatures, according to one embodiment of the invention.
Client module104 issues CD lookup queries to themetadata server105.Metadata server105 looks for query results, and in the case of a match, communicates the set of tracks available for ASi reference signature collection to ASi Inventory Manager2202. ASi Inventory Manager2202 responds with zero, one, or more tracks for which reference signatures should be collected.Metadata server105 annotates the response with an indicator toclient module104 of the track(s) for which signatures should be collected.Client module104 collects the requested signatures, for example during a rip operation, and logs them, for example, using a Quality of Service (QoS)logging subsystem2200.QoS logging subsystem2200 uploads the collected signatures to sink2203.Sink2203 receives and stores uploaded log files2204.
ASi Corroborator2205 processes logs as they are received.ASi Corroborator2205 classifies the received data as either provisional or corroborated and stores the data in theappropriate database2206 or2207.ASi Corroborator2205 also notifies ASi Inventory Manager2202 that a signature was received. ASi Inventory Manager2202 updates its inventory of received signatures.
Periodically,ASi Analyst2208 analyzes the collected signatures.ASi Analyst2208 transforms the reference signatures fromdatabases2206 and2207 into a format suitable for doing ASi retrieval, and stores the result indatabase2209. Whenmodule104 transmits a query request, an audio signature may be passed as part of the request.Server105 passes the signature toASi Retrieval server2210 to find matching tracks for the signature indatabase2209.Server105 consolidates the results and returns them toclient module104.
Subsequently, when a track from a known album is encountered that has either non-standard or non-existent metadata tags, it can be recognized using the audio signature. In addition, results from the TOC search can be disambiguated using the audio signatures as they are associated with albums.
Alternatively, as tracks with non-standard tagging are encountered, if they can be identified using an audio signature, then the non-standard tags can be recorded and associated with the album. Subsequently, when data is acquired by other means when only the non-standard tag is available and the audio signature is not available, the track can still be identified accurately. In one embodiment, this association between tracks is done by semi-automated means. In some instances, only artist and track names may be available (as in the file names for tracks distributed by peer-to-peer services). If, however, a track with identical tags has previously been identified using audio signatures, then the presence of identical tags can be taken as strong evidence about the identity of the track. The result is that the current semi-automated track tag equivalencing process can be fully automated. This makes the input data for recommendation engines much more useful because the data can be referenced back to rich data sources which associate primary artists, other performers, albums, and tracks.
In the above description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the invention. It will be apparent, however, to one skilled in the art that the invention can be practiced without these specific details. In other instances, structures and devices are shown in block diagram form in order to avoid obscuring the invention.
Reference in the specification to “one embodiment” or “an embodiment” means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment of the invention. The appearances of the phrase “in one embodiment” in various places in the specification are not necessarily all referring to the same embodiment.
Some portions of the detailed description are presented in terms of algorithms and symbolic representations of operations on data bits within a computer memory. These algorithmic descriptions and representations are the means used by those skilled in the data processing arts to most effectively convey the substance of their work to others skilled in the art. An algorithm is here, and generally, conceived to be a self-consistent sequence of steps leading to a desired result. The steps are those requiring physical manipulations of physical quantities. Usually, though not necessarily, these quantities take the form of electrical or magnetic signals capable of being stored, transferred, combined, compared, and otherwise manipulated. It has proven convenient at times, principally for reasons of common usage, to refer to these signals as bits, values, elements, symbols, characters, terms, numbers, or the like.
It should be borne in mind, however, that all of these and similar terms are to be associated with the appropriate physical quantities and are merely convenient labels applied to these quantities. Unless specifically stated otherwise as apparent from the discussion, it is appreciated that throughout the description, discussions utilizing terms such as “processing” or “computing” or “calculating” or “determining” or “displaying” or the like, refer to the action and processes of a computer system, or similar electronic computing device, that manipulates and transforms data represented as physical (electronic) quantities within the computer system's registers and memories into other data similarly represented as physical quantities within the computer system's memories or registers or other such information storage, transmission or display devices.
The present invention also relates to an apparatus for performing the operations herein. This apparatus may be specially constructed for the required purposes, or it may comprise a general-purpose computer selectively activated or reconfigured by a computer program stored in the computer. Such a computer program may be stored in a computer readable storage medium, such as, but is not limited to, any type of disk including floppy disks, optical disks, CD-ROMs, and magnetic-optical disks, read-only memories (ROMs), random access memories (RAMs), EPROMs, EEPROMs, magnetic or optical cards, or any type of media suitable for storing electronic instructions, and each coupled to a computer system bus.
The algorithms and displays presented herein are not inherently related to any particular computer, network of computers, or other apparatus. Various general-purpose systems may be used with programs in accordance with the teachings herein, or it may prove convenient to construct a more specialized apparatus to perform the required method steps. The required structure for a variety of these systems appears from the description. In addition, the present invention is not described with reference to any particular programming language. It will be appreciated that a variety of programming languages may be used to implement the teachings of the invention as described herein.
As will be understood by those familiar with the art, the invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. For example, the particular architectures depicted above are merely exemplary of one implementation of the present invention. The functional elements and method steps described above are provided as illustrative examples of one technique for implementing the invention; one skilled in the art will recognize that many other implementations are possible without departing from the present invention as recited in the claims. Likewise, the particular capitalization or naming of the modules, protocols, features, attributes, or any other aspect is not mandatory or significant, and the mechanisms that implement the invention or its features may have different names or formats. In addition, the present invention may be implemented as a method, process, user interface, computer program product, system, apparatus, or any combination thereof. Accordingly, the disclosure of the present invention is intended to be illustrative, but not limiting, of the scope of the invention, which is set forth in the following claims.