TECHNICAL FIELDThe present invention relates to restoration of a backup of a first volume to a second volume on physical media.
BACKGROUNDComputers and computer networks face a myriad of threats to the information they contain and to their smooth and continuous operation. Backup systems and methods have been proposed for maintaining snapshots of information on a computer or a computer system at any given time. With a backup, if the computer system experiences some type of corruption of its stored information the state of the computer can be restored to the point in time when the backup was made.
Computer systems typically maintain sets of files that are stored on various forms of computer media. The locations of the blocks of data, on the media that contain the files' data, are stored in special data structures called file system metadata (or simply metadata) and these metadata, combined with the files' own data, are collectively called a file system. The blocks of data for an entire file system are often contained within a logical construct called a volume, which is a collection of media blocks that may span across one or more media devices. Formatting a volume is a process of writing file system metadata structures to blocks on the underlying volume. File systems generally contain a recognizable portion called a signature within their metadata that makes them recognizable, as a file system, to an operating system. When the operating system recognizes a file system on a volume, the operating system may mount the volume to expose the volume's file contents, for use by the system's users and applications. If the operating system cannot recognize the file system on a volume then it will typically indicate that the volume is unformatted, unrecognized, or a combination thereof. When a volume is unrecognized by the operating system, its contents generally cannot be altered by the user (unless they format the volume, rendering it recognizable), or by typical applications.
To protect computer data, various techniques have been developed to backup that data. One common backup technique involves the creation of backup copies of files on a per-file basis. This technique is often simply called file backup. Many backup procedures use the file system as a starting point and perform the backup by writing the files to a backup storage system. Using this approach, individual files are backed up if they have been modified since the previous backup. This file system backup procedure may be useful for finding and restoring a few lost or corrupted files, but it may include significant overhead in the form of bandwidth and logical overhead to store information about where each file exists within the file system. In addition, complete reconstruction of a storage volume may be difficult using incremental file backups because files that have been deleted may be put back on the volume based on the contents of the incremental backups.
Another technique ignores the individual files and instead backs up the blocks of data of the media on which the file system resides. This technique is often called disk image backup (or image backup, or block backup) because the backup file itself generally contains or represents an image, or copy, of the entire contents of the volume (computer users often loosely call their volumes “Disks” despite the fact that an actual physical media disk may contain more than one volume, which is often the case when a hard disk drive contains multiple partitions, each partition hosting a volume) which may be backed up. In fact, with the disk image backup technique, the backup itself is generally called a backup image (or simply image). Because it backs up the blocks on the media underlying the file system, the disk image backup technique backs up both the files' data and the file system metadata. File backup does not generally backup the file system metadata. Because the disk image backup technique backs up media blocks rather than files, this technique does not necessarily need to be aware of the file system structures or the files contained in the file system. However, in practice, many disk image backup systems may utilize minimal knowledge of the file system in order to only backup blocks of data that are actually in use on the media, ignoring unused blocks.
A disk image backup can be relatively fast because the file structure does not need to be examined. Conversely, because there is no coordination with the file system, no file changes can be made during the backup copy. In addition, retrieving files from the backup copy may be difficult or impossible without restoring the entire volume. Finally, while the data transfer may be fast with block copies, every block with valid data must be copied, resulting in a relatively long process for large volumes (e.g., perhaps even days).
If a computer or server crashes or fails, or the software becomes corrupted, it may become necessary to restore the state of the server to some previously backed-up state. Additionally, if a user desires to migrate or upgrade from one server to another server, the new server must be brought up to the same state with the same information as the server to be replaced. Conventionally, the restore process consists of three steps: (1) the machine is usually physically shut down; (2) a backup of the latest data is created; and (3) the data is copied from the backup storage to the storage of the new server (e.g., a disk drive used by the server). During this operation, a dedicated boot procedure can be used (for example, boot from a floppy disk); and (4) the new server is started.
Between the shutdown and start time, neither server performs any useful functions. Also, the process of restoring all the files of the server is time consuming. In corporate networks, a typical server can have many Terabytes of data. With current data transfer rates over networks and from disks, restoring this much data may take hours, or even days.
Also, even an hour of downtime may be considered an extremely long time, particularly for many mission-critical applications. For example, if the server in question is a web server that handles customer requests or sales, a period of 16-20 minutes of downtime can result in substantial lost sales.
Systems and methods are needed for an effective backup strategy that is flexible in maintaining multiple, accurate backup states, and reducing a downtime when both a backup computer and a restore computer are not operational.
BRIEF SUMMARYIn general, embodiments of the present invention relate to restoration of a backup of a first volume to a second volume on physical media. At least some example embodiments provide an effective backup and restore strategy that is flexible in maintaining multiple backup states and can perform a restore process while the backup process is underway and reduce a downtime in which both a first volume and a second volume are not operational.
In one example embodiment, a method for restoration of a backup of a first volume to a second volume on physical media includes accessing a volume boot record of the second volume while the volume boot record is in a valid state on the physical media, modifying the volume boot record to place the volume boot record into an invalid state on the physical media, writing data from the backup of the first volume to the inaccessible portion of the data segment of the second volume while the volume boot record is in the invalid state, and restoring the volume boot record to the valid state. In this example embodiment, the valid state enables a standard operating system to access a data segment of the second volume and the invalid state prevents the standard operating system from accessing at least a portion of the data segment of the second volume.
BRIEF DESCRIPTION OF THE DRAWINGSIn the drawings:
FIG. 1 illustrates a simplified example of a computing system for practicing embodiments of the present invention;
FIG. 2 illustrates segmenting of a target volume and incremental backups of portions of the target volume;
FIG. 3 is a simplified flow diagram illustrating backup procedures for creating synthetic base backups of a target volume;
FIG. 4 is a simplified flow diagram illustrating backup procedures for creating synthetic incremental backups of a target volume;
FIG. 5 is a simplified flow diagram illustrating backup procedures for creating synthetic incremental backups and synthetic base backups of a target volume;
FIG. 6 is a simplified block diagram of a file control stack for an operating system;
FIG. 7 is a simplified diagram of a disk and control structures thereon;
FIG. 8 is a simplified diagram of a restore volume with an invalidated volume boot record;
FIG. 9 is a simplified diagram of a restore volume defined with a small file structure thereon;
FIG. 10 is a simplified block diagram of a software stack for accessing block media according to embodiments of the present invention;
FIG. 11 is a simplified block diagram of a backup and restore system;
FIG. 12 is simplified flow diagram of a backup and restore process; and
FIG. 13 is a simplified block diagram of a virtual machine using virtual disks.
DETAILED DESCRIPTIONIn the following description, elements, circuits, and functions may be shown in block diagram form in order not to obscure the present invention in unnecessary detail. Conversely, specific implementations shown and described are exemplary only and should not be construed as the only way to implement the present invention unless specified otherwise herein. Additionally, block definitions and partitioning of logic between various blocks is exemplary of a specific implementation. It will be readily apparent to one of ordinary skill in the art that the present invention may be practiced by numerous other partitioning solutions. For the most part, details concerning timing considerations, and the like, have been omitted where such details are not necessary to obtain a complete understanding of the present invention and are within the abilities of persons of ordinary skill in the relevant art.
Furthermore, in this description of embodiments of the invention, reference is made to the accompanying drawings which form a part hereof, and in which is shown, by way of illustration, specific embodiments in which the invention may be practiced. The embodiments are intended to describe aspects of the invention in sufficient detail to enable those skilled in the art to practice the invention. Other embodiments may be utilized and changes may be made without departing from the scope of the present invention. The following detailed description is not to be taken in a limiting sense, and the scope of the present invention is defined only by the appended claims.
Those of skill in the art would understand that information and signals may be represented using any of a variety of different technologies and techniques. For example, data, instructions, commands, information, signals, bits, symbols, and chips that may be referenced throughout the above description may be represented by voltages, currents, electromagnetic waves, magnetic fields or particles, optical fields or particles, or any combination thereof. Some drawings may illustrate signals as a single signal for clarity of presentation and description. It will be understood by a person of ordinary skill in the art that the signal may represent a bus of signals, wherein the bus may have a variety of bit widths and the present invention may be implemented on any number of data signals including a single data signal.
Those of skill would further appreciate that the various illustrative logical blocks, modules, circuits, and algorithm acts described in connection with embodiments disclosed herein may be implemented as electronic hardware, computer software, or combinations of both. To clearly illustrate this interchangeability of hardware and software, various illustrative components, blocks, modules, circuits, and steps are described generally in terms of their functionality. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the overall system Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the embodiments of the invention described herein.
The various illustrative logical blocks, modules, and circuits described in connection with the embodiments disclosed herein may be implemented or performed with a general purpose processor, a special purpose processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other programmable logic device, discrete gate or transistor logic, discrete hardware components, or any combination thereof designed to perform the functions described herein. A general-purpose processor may be a microprocessor, but in the alternative, the processor may be any conventional processor, controller, microcontroller, or state machine. A processor may also be implemented as a combination of computing devices, e.g., a combination of a DSP and a microprocessor, a plurality of microprocessors, one or more microprocessors in conjunction with a DSP core, or any other such configuration.
Also, it is noted that the embodiments may be described in terms of a process that is depicted as a flowchart, a flow diagram, a structure diagram, or a block diagram. Although a flowchart may describe operational acts as a sequential process, many of these acts can be performed in another sequence, in parallel, or substantially concurrently. In addition, the order of the acts may be re-arranged. A process may correspond to a method, a function, a procedure, a subroutine, a subprogram, etc. Furthermore, the methods disclosed herein may be implemented in hardware, software, or both. If implemented in software, the functions may be stored or transmitted as one or more instructions or code on a computer-readable medium. Computer-readable media includes both computer storage media and communication media including any medium that facilitates transfer of a computer program from one place to another.
It should be understood that any reference to an element herein using a designation such as “first,” “second,” and so forth does not limit the quantity or order of those elements, unless such limitation is explicitly stated. Rather, these designations may be used herein as a convenient method of distinguishing between two or more elements or instances of an element. Thus, a reference to first and second elements does not mean that only two elements may be employed there or that the first element must precede the second element in some manner. Also, unless stated otherwise a set of elements may comprise one or more elements.
Embodiments of the present invention include methods, computing systems, and computer-readable media, for providing an effective backup and restore strategy that is flexible in maintaining multiple backup states and can perform a restore process while the backup process is underway and reduce a downtime when both a backup computer and a restore computer are not operational.
The term “boot record” as used herein means metadata on a storage medium that makes the storage medium recognizable to an operating system as a storage medium that contains a file structure accessible by file system software. If the boot record contains valid information about the file structure, then an operating system can recognize and expose the file structure. This exposure of the file structure contained on a volume's storage medium is called “mounting,” and it is this mounting process that enables users and applications to utilize the files stored on the storage medium using file system software. Thus, as a non-limiting example, a Volume Boot Record (VBR) is a boot record on a volume that makes the volume recognizable by an operating system as containing a file structure.
The term “private block access” as used herein means accessing block storage medium through a private Input/Output (IO) interface configured for block accesses that is available to embodiments of the present invention, but is not generally available to applications or general operating system processes. Similarly, a private block access process is a process that uses one or more private block accesses to perform IO operations to the block storage medium.
The term “public block access” as used herein means accessing block storage medium through an IO interface configured for direct block access to the block storage medium that is exposed to applications and operating system processes.
The term “block access” as used herein means a private block access or a public block access.
The term “file access” as used herein means accessing block storage medium through a file system that translates a file access request from applications or general operating system processes to one or more public block accesses.
Embodiments of the present invention deal with disk image backup techniques. Therefore, unless specified otherwise, references to backup imply backup using a disk image backup technique.
One challenge faced by backup systems is the need to be able to capture the state of the data for all of the blocks on a volume at a given moment, without locking or interrupting access to the volume by applications and users, despite the fact that the volume may be in use and changes may be in the process of being written to the volume. Techniques to expose such a volume state (often called a snapshot state or simply a snapshot), without affecting the usability of the volume itself, may be referred to as volume snapshot or volume shadow copy techniques. Both of these backup techniques often leverage snapshot technology in order to backup data on in-use volumes without interrupting users and applications.
Another challenge faced by backup systems is the need to make periodic backups in order to capture any changes since prior backups. The first time a volume is backed up, the resulting backup is typically a file called a full backup image, which contains all (or all in-use) blocks on the volume at the time the backup was created. When the next backup of the volume occurs, only the blocks of data on the volume that have changed since the prior backup need to be saved. Such a backup is typically called an incremental backup and results in a file called an incremental backup image, which contains just those blocks that have changed since the prior backup. A first incremental backup, combined with the prior full backup, contain all of the data for the blocks on the volume at the time that the first incremental was created. The first incremental file is therefore only useful if the full image file is also available. In other words, the first incremental file is dependent upon the full image file. Subsequent backups may also be incremental backups, each containing just the data for the blocks that were changed since the prior backup, and each subsequent incremental image file is dependent upon the incremental image file for the prior backup. A full image file, and all of the incremental image files which depend upon it, can be called a “backup image file chain” (or simply a “chain”). Each image file in the chain, combined with the image files upon which it depends, represents the state of the backed up volume's data at a specific point-in-time, and therefore sometimes each image file in the chain is called a “point-in-time” or a “recovery point.” Mechanisms may be used to enable backup systems to actively track the blocks which change between backups, such that the creation of incremental backups does not require a time-consuming comparison of the current state of the volume with a prior state, but rather a list of the changed blocks is available and is used to determine which blocks are to be saved in the incremental image file. This list makes it possible for such periodic incremental backups to be efficient with both time, and storage space.
With contemporary image backup systems, when restoring a backup image file, that image file, and all of the image files upon which it depends, must be utilized (read from) in order to restore the target volume to a state where it contains the same set of blocks as were contained by the source volume at the time the backup was created. In scenarios where the backup contains many billions or even trillions of bytes of data, restoring all of that data to the target volume can take an inordinate amount of time, potentially many days.
If the source volume of a backup image chain contains a recognized file system, then typically when a point-in-time in an image chain is restored to a target volume, the operating system will recognize the data on the target volume as being a recognized file system, and will mount that file system. The mounting is the exposure of the file system's file contents for use by applications and users. When the operating system mounts a file system, the process of mounting alone can results in changes being written to the volume, such as writes to various logs and file system journaling structures on the volume. Also, once mounted by the operating system, the volume's contents can be easily altered by users and applications. When any such writes occur, the state of the target volume's data no longer exactly reflects the state of the point-in-time data that was restored to the target volume. If the image chain, which was utilized for the restore, contained points-in-time that are chronologically younger or newer than the point-in-time which was actually restored, then it is not typically possible to subsequently restore a younger point-in-time in the same chain to the same target restore volume, after it has been mounted by the operating system and altered by writes, without losing the data for those writes which occurred after the mount. In other words, when a point-in-time has been restored to a target volume and the target volume has been mounted by the operating system, typically it is not possible to then restore additional points-in-time from the same chain to that restore target volume without the risk of losing data.
If a volume fails, ideally the data for that volume's backup's most-recent point-in-time is made available with minimal downtime between the volume failure and the exposure of the last point-in-time's data for use by users and applications. Because new point-in-time backups are generally being created for the source volume up until the time that the source volume becomes unavailable (due to failure, etc.), to restore the latest point-in-time backup conventional image backup systems cannot even begin their restore operation until the source volume has actually become unavailable, at which point they can choose the most-recent point-in-time as only then is it determined to actually be the most-recent point-in-time. Conventional image backup systems are not able to restore, in advance, some of the points-in-time, while the source volume may still be available, and then later as a separate restore operation (perhaps separated in time by a system reboot). Subsequent points-in-time in the same chain cannot be restored without risking corruption in between such operations due to the exposure, by the operating system, of the volume's file system content.
This limitation has made it difficult to preemptively prepare a standby restore volume with the contents of the last point-in-time that was backed up from the source volume prior to the source volume becoming inaccessible (due to failure or other events).
When a mission-critical computer system has failed, the costs of downtime while a restore is being performed with conventional backup systems to a replacement system can be immense. Even one hour of lost sales due to a failed web server can result in substantial losses. With terabytes of data, the downtime can extend to days.
FIG. 1 illustrates acomputing system100 for practicing embodiments of the present invention. Thecomputing system100 may be a user-type computer, a file server, a compute server, or other similar computer including data storage capability. Computer, computing system, and server are used interchangeably herein to indicate a system for practicing embodiments of the present invention. Thecomputing system100 is configured for executing software programs containing computing instructions and includes one ormore processors110,memory120, one ormore communication elements150,operational storage130, andbackup storage140.
The one ormore processors110 may be configured for executing a wide variety of operating systems and applications including the computing instructions for carrying out embodiments of the present invention.
Thememory120 may be used to hold computing instructions, data, and other information for performing a wide variety of tasks including performing embodiments of the present invention. By way of example and not limitation, the memory may include Synchronous Random Access Memory (SRAM), Dynamic RAM (DRAM), Read-Only Memory (ROM), Flash memory, and the like.
Thecommunication elements150 may be configured for communicating with other devices or communication networks. By way of example, and not limitation, thecommunication elements150 may include elements for communicating on wired and wireless communication media, such as, for example, serial ports, parallel ports, Ethernet connections, universal serial bus (USB) connections, IEEE 1394 (“firewire”) connections, BLUETOOTH® wireless connections, 802.1 a/b/g/n type wireless connections, and other suitable communication interfaces and protocols.
Theoperational storage130 andbackup storage140 may be used for storing large amounts of non-volatile information for use in thecomputing system100. Theoperational storage130 andbackup storage140 may be configured as one or more storage devices. By way of example and not limitation, these storage devices may include computer-readable media (CRM). This CRM may include, but is not limited to, magnetic and optical storage devices such as disk drives, magnetic tapes, CDs (compact disks), DVDs (digital versatile discs or digital video discs), and other equivalent storage devices. Furthermore, the storage devices may include Redundant Array of Independent Disks (RAIDS) storage systems.
While illustrated as two discrete blocks inFIG. 1, those of ordinary skill in the art will recognize that theoperational storage130 andbackup storage140 may actually be resident on the same mass storage device such that a portion of the mass storage device contains operational information for storing software and data used by thecomputing system100 and a portion of the mass storage device includes backup information. Furthermore, theoperational storage130 may span multiple devices and thebackup storage140 may span multiple devices.
Software processes illustrated herein are intended to illustrate representative processes that may be performed by the systems illustrated herein. Unless specified otherwise, the order in which the process acts are described is not intended to be construed as a limitation, and acts described as occurring sequentially may occur in a different sequence, or in one or more parallel process streams. It will be appreciated by those of ordinary skill in the art that many steps and processes may occur in addition to those outlined in the flow charts. Furthermore, the processes may be implemented in any suitable hardware, software, firmware, or combinations thereof.
When executed as firmware or software, the instructions for performing the processes may be stored on a computer-readable medium. A computer-readable medium includes, but is not limited to, magnetic and optical storage devices such as disk drives, magnetic tape, CDs (compact disks), DVDs (digital versatile discs or digital video discs), and semiconductor devices such as RAM, DRAM, ROM, EPROM, and Flash memory.
By way of non-limiting example, computing instructions for performing the processes may be stored on theoperational storage130, transferred to thememory120 for execution, and executed by theprocessor110. Theprocessor110, when executing computing instructions configured for performing the processes, constitutes structure for performing the processes. In addition, some or all portions of the processes may be performed by hardware specifically configured for carrying out the processes.
While some of the discussion herein may be directed toward MICROSOFT® operating systems such as WINDOWS NT® and WINDOWS XP®, the present invention is not so limited and is applicable to many operating systems. In addition, the discussion may relate to managing file and directory structures on storage volumes. Generally, file movements are referred to as read accesses and write accesses. It will be understood by those of ordinary skill in the art that other file accesses are possible within the scope of the invention, such as, for example, changing dates on a file, changing privileges on a file, and changing other file status indicators. Furthermore, the discussion generally refers to files. However, unless specified otherwise, references to files may include files, directories, and other similar data structures associated with mass storage devices. A volume may include a variety of mass storage devices, such as, by way of example and not limitation, hard disk drives, flash drives, floppy disks, digital cameras, music players, and similar devices.
FIG. 2 illustrates partitioning of atarget volume160 and a shadowedincremental backup170 of portions of thetarget volume160. In the embodiment ofFIG. 2, thetarget volume160 is partitioned as a physical (i.e., sector) layout of the disk or volume with sectors160-1 through160-n.The shaded sectors illustrate sectors that include information stored thereon at a given point in time. The shadowedincremental backup170 is a storage area that tracks (i.e., shadows) and includes changes in the information stored on thetarget volume160 after the given point in time. Thus, at time t0, thetarget volume160 includes data at sectors160-1,160-3 through160-6, and160-n-2. Also at time t0, the shadowedincremental backup170 does not include any information.
As time progresses after time t0, changes may occur to specific sectors on thetarget volume160. With incremental sector-type backup systems, when information within a sector is modified, theincremental backup170 is written to include the same information as a corresponding sector on thetarget volume160. As an example, if information within a file at sector160-4 is modified, the incremental backup process creates acopy162 of the entire sector as structure170-4 on theincremental backup170 including the change to the information within the file in question. If additional changes are made to information within sector160-4, the same changes are made to structure170-4 on theincremental backup170.
As another example, perhaps a new file is created and placed in sector160-n-2. The incremental backup process will copy166 the contents of the160-n-2 sector to structure170-n-2 of theincremental backup170. Any subsequent changes to information in sector160-n-2 will also be made to structure170-n-2 of theincremental backup170.
As another example, perhaps information is added to previously empty segment160-7 on thetarget volume160. The incremental backup process will add164 the same information to structure170-7 of theincremental backup170. Any subsequent changes to information in sector160-7 will also be made to structure170-7 of theincremental backup170.
For ease of discussion, theincremental backup170 is illustrated as structures (e.g.,170-4,170-7,170-n-2) of theincremental backup170 corresponding to the sectors (e.g.,160-4,160-7,160-n-2) of thetarget volume160. However, while the structures of theincremental backup170 will have the same data size as the sectors of thetarget volume160, the structures do not need to be configured on sector boundaries, be contiguous, or be in the same relative location as thetarget volume160. Thus, a structure, such as, for example, structure170-4 may include information at various locations within theincremental backup170 and may be stored at any location within theincremental backup170. In other words,FIG. 2 illustrates a physical arrangement of thetarget volume160, but only a logical arrangement of theincremental backup170.
With this sector backup process, sectors may contain a variety of information, such as, for example, files, portions of files, and metadata such as directory structures and file pointers. In addition, it does not matter how many times information in a sector changes, only the last result at the time that an incremental backup is created and is stored. Thus, if incremental backups use a shadowing process, the shadow copy keeps a coherent image of the changes that are made to the volume as the changes are made. Then, to create an incremental backup, a time point is defined. The shadow copy is saved as the incremental backup at that time point and a new shadow copy is created showing no changes, but ready to be modified with changes that occur after the time point.
FIG. 3 is a flow diagram200 illustrating backup procedures for creating synthetic base backups of a target volume. First, abase image210 is created at time t0. This base image is a backup of the entire volume. As stated earlier, abase image210 may be very large depending on the size of the volume being backed up and the amount of information on the volume at the time thebase image210 is made. As a result, a base image may take a relatively long time to create and consume considerable storage space.
After thebase image210 has been created, incremental images may be created. As shown inFIG. 2, an incremental image includes changes to a volume that occur after a specific point in time. Thus,incremental backup1 includes all sectors that have changes after time t0 and up to the time t1 ofincremental backup1.
Similarly,incremental backup2 includes all changes made to sectors of the volume after the point in time thatincremental backup1 was created and up to the time t2 ofincremental backup2. The process of creating incremental backups may be repeated indefinitely, up to incremental backup N.
FIG. 3 illustrates additionalincremental backups3,4,5, and up to N in abackup procedure200. These incremental backups are sector based. As a result, to restore any given file, or the entire volume, a restoration process must take into account the most recent incremental backup, thebase image210, and all intervening incremental backups. Of course, a sector may have changed for each incremental backup. Thus, only the most recent backup includes the most recent information for that sector. For example, to get a true picture of the volume at time t4, the restoration process would take into account: thebase image210; changes made between time t1 and t0 and reflected inincremental backup1; changes made between time t2 and t1 and reflected inincremental backup2; changes made between time t2 and t3 and reflected inincremental backup3; and changes made between time t4 and t3 and reflected inincremental backup4.
Consequently, one of the problems with creating long backup chains is that thebase image210 must be combined with a large number of intervening incremental backups to get a true picture of the volume. As a non-limiting example, suppose a base-image is created on a first day. Then each subsequent day, two incremental backups are made at different times. After a couple of weeks, there would be28 or more incremental backups that would need to be combined with thebase image210 to create a full picture of the target volume after those 14 days.
On the other hand, because of the large size of a base image and the relatively long time to create a base image, creating base images more often also may be difficult. In addition, for security and redundancy reasons, many organizations maintain a copy of backups at remote sites. With remote site storage, the target volume can still be recovered if the base image or the incremental backup images are removed from the local site. For example, backup information at local sites may be removed periodically to allow for more storage space at the local site. In addition, a hardware or software failure may corrupt data on the local volume or otherwise prevent access to the local data. In these cases, the backup information can be retrieved from the remote site to restore the state of the local volume.
Thus, a balance must be struck between the time, storage, and communication issues associated with a base image relative to the processing time associated with long incremental backup chains.
Embodiments of the present invention can reduce the length of the incremental backup chain by creating synthetic base images. As a non-limiting example, inFIG. 3synthetic base image3 may be created by combining all the information frombase image210,incremental backup1,incremental backup2, andincremental backup3. Thus,synthetic base image3 becomes a complete base image at time t3. Now, to get a true picture of the volume at time t4, the restoration process could usesynthetic base image3 as a starting point rather than thebase image210. As a result, a true picture of the target volume at time t4 may be obtained by combiningsynthetic base image3, with the changes made between time t4 and t3 that are reflected inincremental backup4.
Similar tosynthetic base image3,synthetic base image5 may be created by combining thebase image210 with all of the incremental backups1-5. As a result,synthetic base image5 represents a base image (e.g., a full backup) at time t5. While not illustrated inFIG. 3, those of ordinary skill in the art will recognize thatsynthetic base image5 also may be created by combining the information fromsynthetic base image3 withincremental backup4 andincremental backup5. In the end,synthetic base image5 should contain the same information through either combination process.
FIG. 4 is a flow diagram illustratingbackup procedures220 for creating synthetic incremental backups (also referred to as differential backups) of a target volume. Another way to reduce the length of chains of incremental backups is to create synthetic incremental backups at intermediate stages in the chain. As withFIG. 3,FIG. 4 includesbase image210, incremental backups1-6, and up to incremental backup N at times t0 through tN. In a manner similar to the process for creating a synthetic base image, a synthetic incremental backup may be created by combining one or more contiguous incremental backups.
As a non-limiting example, synthetic incremental backup1-3 may be created by reflecting all the changes made betweenincremental backup1 andincremental backup3. Thus,incremental backup1,incremental backup2, andincremental backup3 are combined to create synthetic incremental backup1-3. Similarly, synthetic incremental backup4-5 may be created by combining the information fromincremental backup4 andincremental backup5.
Many combinations are possible for creating synthetic incremental backups. Furthermore, synthetic incremental backups may be used to create other synthetic incremental backups. As non-limiting examples, synthetic incremental backup1-5 may be created by combining the information from synthetic incremental backup1-3 and synthetic incremental backup4-5.
As additional non-limiting examples, synthetic incremental backups may be combined with individual incremental backups. InFIG. 4, synthetic incremental backup4-6 may be created by combining the information from synthetic incremental backup4-5 andincremental backup6. Thus, synthetic incremental backup4-6 includes all the information on changes made to the volume between times t4 and t6. Similarly, synthetic incremental backup1-6 may be created by combining the information from synthetic incremental backup1-5 andincremental backup6.
With synthetic incremental backups created, the backup and restoration process may be configured to delete intermediate incremental backups or may keep the intermediate incremental backups. For example, with synthetic incremental backup1-3 created,incremental backups1 through3 could be deleted. However, if they are deleted, some time granularity may be lost. In other words, if information is to be restored, information could be obtained at time t3 or at time t0, but information at times t1 and t2 is no longer available.
On the other hand, if the original incremental backups are retained along with the synthetic incremental backups, full time granularity can be maintained, and the incremental chain used to restore information at a point in time can also be reduced.
As a non-limiting example, if information is to be restored for time t6, a number of optional chains are available based on the synthetic incremental backups created inFIG. 4. One chain could be usingbase image210 in combination with incremental backups1-6. Another chain could usebase image210 in combination with synthetic incremental backup1-3 and incremental backups4-6. Another chain could usebase image210 and the combination of synthetic incremental backup1-3, synthetic incremental backup4-5, andincremental backup6. Still another chain could usebase image210 in combination with synthetic incremental backup1-5 andincremental backups6. Those of ordinary skill in the art will recognize that many other combinations of incremental backups and synthetic incremental backups may be used.
FIG. 5 is a flow diagram illustratingbackup procedures240 for creating synthetic incremental backups and synthetic base backups of a target volume. The process of creating synthetic incremental backups may be combined with the process to create synthetic base images to reduce processing time and allow for more granularity in accessing base images. As withFIGS. 3 and 4,FIG. 5 includesbase image210, incremental backups1-6, and up to incremental backup N at time t0 through tN. As explained earlier, synthetic incremental backups1-3 and4-5 may be created. As a non-limiting example,synthetic base image3 may be created by combining the information frombase image210 and synthetic incremental backup1-3. As another non-limiting example,synthetic base image5A may be created by combining the information frombase image210, synthetic incremental backup1-3,incremental backup4, andincremental backup5. As another option for a base image at time t5, synthetic base image5B may be created by combining information fromsynthetic base image3 and synthetic incremental backup4-5. Those of ordinary skill in the art will recognize that many other combinations of incremental backups, synthetic incremental backups, and synthetic base images may be used.
FIG. 6 is a simplified block diagram of a file control stack for an operating system. Software organization of a computing system is generally divided between a user mode300 and akernel mode350. The user mode300 includes many operating system310 (OS) functions anduser applications320 configured for operation with theoperating system310. TheOS310 may include operating system processes312 and Application Program Interfaces (APIs)314 for providing a general interface betweenuser applications320 and lower level kernel software modules.
Thekernel mode350 includes lower level operating system functions, such as acomputing file system360,logical volume control370 and device drivers for the various hardware devices in a system. Theoperating system310 anduser applications320 may make file Input/Output (I/O) requests in the form of read accesses and write accesses (also referred to as access requests) to thecomputing file system360.
Thecomputing file system360 passes these access requests on to thelogical volume control370 to determine what disk(s)380, and what sector(s) on the disk(s)380 should be accessed to perform the access request. Local disk requests are made to aport module385 that converts the disk request into the necessary hardware controls to retrieve or store the desired data. Of course, as stated earlier, adisk380 may include multiple logical volumes on different partitions of the disk.
Thedisks380 may be a variety of devices, such as magnetic disks, optical disks, solid state disks, Universal Serial Bus (USB) drives, and the like.
FIG. 7 is a simplified diagram of an example of one type ofdisk380 and control structures thereon. Thedisk380 includes amaster boot record410 and one or more partitions, such as, for example,partition420A,420B, andunused sector382. The master boot record (MBR)410 may include bootstrap code, a partitioning table, and other information related to partitioning of the disk into one or more partitions, volumes, or combinations thereof.Volumes420A and420B may be referred to generically herein as420. For some operating systems, and disk structures, the MBR may be between 1 and 63 sectors on thedisk380 and generally starts atsector0 of thedisk380. When a computer boots, the OS examines theMBR410, which includes a jump instruction to another portion of thedisk380 for performing the boot process. TheMBR410 also includes disk layout and partition information so the OS can determine size and location of logical volumes on the disk.
Eachvolume420 includes avolume boot record422 and adata segment450. Thevolume boot record422 of avolume420 may include a jump instruction to code that may be used in determining information about thevolume420. Thevolume boot record422 of avolume420 may also include a Basic Input Output System (BIOS) parameter block to give details about how to perform input/output operations for thevolume420. Thevolume boot record422 also gives information about thevolume420 such as size and file system type, such as, for example, Linux, FAT16, FAT12, FAT32, NTFS, and the like. When an OS recognizes valid information in thevolume boot record422 the OS can mount thevolume420, which makes thedata segment450 available to IO accesses from other OS processes and user applications.
If thevolume420 is anactive boot volume420A, after processing theMBR410, the OS jumps to the beginning of thevolume boot record422 of theboot volume420A. The OS uses the jump instruction at the beginning of thevolume boot record422 of theboot volume420A to jump to a process for completing the boot process and loading of the OS. As part of the process, the OS uses theMBR410 to finddata volumes420B, determine if they are valid, and mount them if they are valid.
Some embodiments of the present invention may invalidate thevolume boot record422 of avolume420 and create aprivate structure425 for thevolume boot record422. With an invalidvolume boot record422, the OS may still know that thevolume420 is present (using the information in the MBR410), but it cannot get information about the file system on thevolume420 and will therefore not mount thevolume420. With thevolume420 unmounted, no file accesses to thevolume420 can be made by the standard file system of the OS. As a result, no writes can be made to thevolume420 under normal file access control and writes to thevolume420 may be limited to embodiments of the present invention.
FIG. 8 is a simplified diagram of a volume420 (also referred to herein as a restore volume) with an invalidatedvolume boot record422 for the private structure425 (FIG. 7). Thevolume420 is divided up into a number ofsectors440. For simplicity and clarity in the drawings, only some of the sectors are shown and the others are implied as filling in the blank spaces of thevolume420. As shown in the drawings, thevolume boot record422 is illustrated assector0 of thevolume420. For most conventional operating systems the volume boot record is defined assector0. However, embodiments of the present invention can include thevolume boot record422 as any sector or sectors on thevolume420.
Thevolume boot record422 is shown in an exploded view showingvarious bytes424 of thevolume boot record422 to show details of thevolume boot record422. Aboot segment430 of thevolume boot record422 includes the information necessary for the OS to recognize thevolume420 as including a valid file system. InFIG. 8, thisboot segment424 is set to all zeros, which invalidates thevolume420 from the perspective of the OS. In other words, the OS may view thevolume420 as an unformatted volume. Thevolume boot record422 may include other information useful to the OS, which may or may not be set to zeros, as indicated by the “X” in some of the bytes.
While zeros in theboot segment430 cause thevolume420 to be unrecognizable to the standard file system of the OS, thevolume boot record422 also includes a special boot identifier (SBI)435. Thisspecial boot identifier435 can be read by a private block access in a special OS or a special file system within the standard OS as is explained more fully below with reference toFIG. 10. This special access path is generally referred to herein as a private block access. However, other special systems, not necessarily performing a restore operation, may be defined to use thespecial boot identifier435 as described herein.
The private block access can read thevolume boot record422 and even though it sees zeros in thevolume boot record422, it can look at another area of thevolume boot record422 in an attempt to find thespecial boot identifier435. In the example shown inFIG. 8, thespecial boot identifier435 is shown in the last few bytes of thevolume boot record422. Of course, thespecial boot identifier435 may be different sizes and at different locations with thevolume boot record422 as long as the private block access knows where to find it within thevolume boot record422. The private block access then knows that the balance of the restorevolume420 is available for a restore process as explained below with reference toFIGS. 11 and 12.
Also shown inFIG. 8 is a restore information block460. The combination of thespecial boot identifier435 and the restore information block460 comprise a set of restore metadata that may be used by a restore process. The restore metadata may include a variety of information, such as, for example, information related to a restore process for thevolume420.
As a non-limiting example, the restore metadata may include: 1) thespecial boot identifier435; 2) a volume identifier; 3) checkpoint information; and 3) a validation block. Thespecial boot identifier435 has been discussed above.
The volume identifier may be included to indicate for which volume the restore metadata is being used. In volumes where the restore metadata is stored on the volume, the volume identifier is implicit. However, the restore metadata may be stored in a file on another volume, with only a pointer to the restore metadata existing in the special boot identifier. In the case of a file, the volume identifier explicitly identifies which volume the restore metadata is for.
The checkpoint information may include various information about where in the restore process the current restore volume is, as will become more apparent from the discussion of the restore process with reference toFIG. 12. Non-limiting examples of checkpoint information include: a most recently applied image file (e.g., point in time, such as base image or incremental backup image), progress within the current image file, time stamps including data and time, bytes within the current image applied so far, and total bytes within the current image.
The validation block may include information to verify that the restore metadata has not been corrupted using, for example, a checksum or a hash function. A hash function provides a small digital “fingerprint” from any kind of data input. By chopping and mixing the original data, the hash algorithm can create a small hash value that is unique to the data run through the hash algorithm. The resulting fingerprint is often called a “hash value.” One property of a hash value is that if two hash values generated with the same hash algorithm are different, then the two inputs are different in some way. As a result, a hash function can be used as a checksum such that the relatively small hash value can be used to verify that a data file of any size has not been altered.
FIG. 9 is a simplified diagram of a restorevolume420 defined with asmall file structure437 thereon. In some embodiments, rather than invalidating the restorevolume420, the restorevolume420 may be defined as a small file structure (e.g., FAT12) using the proper information for thesmall file structure437 as represented inFIG. 9 by the Small File Identifier (SFID). Thevolume boot record422 is shown in an exploded view showingvarious bytes424 of thevolume boot record422 to show details of thevolume boot record422. Aboot segment430 of thevolume boot record422 includes the information necessary for the OS to recognize the restorevolume420 as a valid volume.
InFIG. 9, thisboot segment430 is set to represent thesmall file structure437. Thus, when the OS examines thevolume420, it will determine that the volume is small and, as a result thesectors440 of thevolume420 not occupied by thesmall file structure437 will be inaccessible by the OS and other applications.
However, as with the embodiments ofFIG. 8, the volume boot record422 (or a file within the small file structure) also includes a special boot identifier (SBI)435. Thisspecial boot identifier435 can be read by a private block access in a special OS or a special file system within the standard OS as is explained more fully below with reference toFIG. 10. This special access path is generally referred to herein as a private block access. However, other special systems, not necessarily performing a restore operation, may be defined to use thespecial boot identifier435 as described herein.
The private block access can read thevolume boot record422 and even though it sees asmall file structure437, it can look at another area of thevolume boot record422, or a file on the small file structure, in an attempt to find thespecial boot identifier435. In the example shown inFIG. 9, thespecial boot identifier435 is shown in the last few bytes of thevolume boot record422 and as a file at the end of the small file structure. Of course, thespecial boot identifier435 may be different sizes and at different locations as long as the private block access knows where to find it. The private block access then knows that the balance of the restorevolume420 not taken up by thesmall file structure437 is available for a restore process as explained below with reference toFIGS. 11 and 12.
Also shown inFIG. 9 is a restore information block460. The combination of thespecial boot identifier435 and the restore information block460 comprise a set of restore metadata that may be used by a restore process. The restore metadata may include a variety of information, as explained above with reference toFIG. 8.
FIG. 10 is a simplified block diagram of a software stack for accessing block media according to embodiments of the present invention.FIG. 10 illustrates, from a process perspective, block level accesses to blockmedia399.Block media399 are generally accessed in block level units. The operating system and applications running on the operating system may perform block media accesses as illustrated byblock321.
Access to blockmedia399 may be in the form of public accesses, which are generally exposed to applications through software processes within the operating system, file system, or combinations thereof. Accesses to block media may also be through private channels set up by specialized software processes that are not exposed to other applications. These private and public access channels may also be referred to as IO control interfaces (IOCTLs).
The private channels may be set up as a software process operating at an application level to perform direct block access in a manner that is private from other applications by a privateblock access process322. The private accesses may also be set up using a software process operating as ablock device filter390 below thefile system360.
Due to these many channels, there may be many ways to access theblock media399. Each of the channels shown inFIG. 10 is discussed below. Operations to blockmedia399 are generally referred to as accesses and may be understood to include both read and write operations. From a file centric perspective, file accesses394 are generally considered public. Thefile system360 may convert apublic file access394 to one or more public block accesses392C to access theblock media399 and collect the necessary information to assemble the contents of a file, which is then passed back to the application making the access request.
In some embodiments of the invention, theblock device filter390 may reside below thefile system360 and filter out, or pass through, the public block accesses392C from thefile system360. If apublic block access392C is allowed to pass through it is presented to theblock media399 as apublic block access396A.
From a block access perspective, an application may directly access block media399 (in block form rather than file form) through software processes exposed by the operating system as indicated by public block access (392A and392B). An application may also set up its own privateblock access process322 to performprivate block access392D and392E.
In embodiments that include ablock device filter390, the public block accesses392B may be blocked, or may be passed on as public block accesses396A. Similarly, private block accesses392D may be blocked, or may be passed on as private block accesses396B. In describing embodiments that use theblock device filter390, public block accesses392B are performed, rather than public block accesses392A, because theblock device filter390 can filter all accesses to theblock media399 and public block accesses392A would not be available. Similarly, private block accesses392D are performed, rather than private block accesses392E, because theblock device filter390 can filter all accesses to theblock media399 and private block accesses392E would not be available.
In embodiments that do not use ablock device filter390, public block accesses392A and private block accesses392E are available, whereas public block accesses392B and private block accesses392D are not available. Public file accesses394 will generate block accesses392C directly (not shown) to theblock media399.
In embodiments that invalidate a volume boot record, as discussed above, ablock device filter390 is not needed, however, some embodiments may use both invalidation of the volume boot record and theblock device filter390 to prevent undesired accesses to theblock media399. Thus, when using an invalidated boot record, a restore process may prevent a restore volume, or a portion thereof, from being recognized by a standard file system by inserting a restore structure in at least a portion of the volume boot record of the restore volume using a block access (392A or392E), which may be through apublic channel392A or aprivate channel392E. The restore structure may be used by the restore process to hold various metadata related to the restore process and at the same time causes the restore volume to be unrecognizable by thefile system360.
After the restore structure is present in the volume boot record of the restore volume, the restore process may use public block accesses392A or private block accesses392E to write restore data to restore volume. Normal public file accesses394 are prevented to the restore volume because the volume boot record is unrecognizable so the restore volume cannot be mounted. After all the restore data is written to the restore volume, the block accesses (392A or392E) are used to the restore volume to finalize it by restoring the volume boot record to a state indicating that the restore volume is valid.
In embodiments that use ablock device filter390, the volume boot record may not need to be invalidated, however, some embodiments use both invalidation of the volume boot record and theblock device filter390 to prevent undesired accesses to theblock media399. Theblock device filter390 prevents public block accesses392B from applications and public block accesses392C from reaching theblock media399 as public block accesses396A. Only private block accesses392D (passed through theblock device filter390 as private block accesses396B) and private block accesses392E are allowed to access theblock media399. Thus, theblock media399 is effectively prevented from being modified, except through theblock device filter390 orprivate block control322, which are used by the restore process.
To perform the restore process, the restore volume, or a portion thereof, is prevented from undesired changes by theblock device filter390 preventing public media accesses (e.g., public file accesses394, public block accesses392B and public block accesses392C) to the restore volume and allow private block accesses (392D and392E) to the restore volume. The restore process may use private block accesses (392D and392E) to write restore data to the restore volume while the public media accesses (394,392B, and392C) are being prevented. After all the restore data is written to the restore volume, theblock device filter390 may be removed or reconfigured to allow public media accesses (394,392B, and392C) to finalize the restore volume.
FIG. 11 is a simplified block diagram of a backup and restore system. The system illustrates abackup computer480 withoperational storage482 andbackup storage484 controlled by one ormore processors486,memory488, and communication elements489. A restorecomputer490 includesoperational storage492 and restorestorage494 controlled by one ormore processors496,memory498, andcommunication elements499. In embodiments of the disclosure thebackup computer480 and the restorecomputer490 may be physically near each other or may be in vastly different physical locations. In addition, in some embodiments, thebackup computer480 and restorecomputer490 may be the same computer withbackup storage484,operational storage482, and restorestorage494. In such embodiments, it will be understood that the processes described herein as occurring on two different computers may all occur on the same computer.
Backup images may be communicated between thebackup computer480 and the restorecomputer490 via acommunication link460. This communication may be accomplished in a number of ways. As non-limiting examples, the backup images may be communicated electronically across a wide-area network (WAN) or a local-area network (LAN).
As stated earlier, base images may be quite large. Therefore, as other non-limiting communication examples, the base image, incremental images, and combinations thereof may be placed on media such as a magnetic media or an optical media and physically transported by any appropriate transportation element from the local site to the remote site. Furthermore, prior to communication (either electronically or by transport), the base image may be compressed to reduce the size of the image or encrypted for security.
As mentioned earlier performing a restore process when a computer has failed may be an extremely long process when terabytes of data are involved. In addition, when migrating data from one computer to another (e.g., in an upgrade) it may be difficult to have both the old computer and the upgrade computer down for the entire time required to perform a restore process.
In conventional restore systems, a final backup would need to be made on the old computer and then access to that old computer prevented so that the new computer can be restored to a same and coherent state with the old computer. Thus, conventionally, restore processes have been performed over night or on weekends to minimize the impact of down time on the users. For example, a server with terabytes of data may take 18 to 24 hours (or even longer) to restore if the restore process can restore 1 GB per minute. In addition, restore processes are generally not completely automated and must be monitored by a system administrator. If any problems with the restore occur, the entire restore process may need to be started from the beginning, creating even more down time.
Embodiments of the present invention use the invalidated volumes discussed above to perform a headstart restore process, wherein a restore may begin on a new machine while the old machine is still operating in its normal mode performing its usual functions and incremental backup functions as explained above.
By marking a volume as invalid to the operating system, the OS and user applications cannot modify that volume. However, a private block access operating separate from or in combination with the OS can recognize the invalid volume and begin restoring backup files from the old computer on the new computer while the old computer is still running.
The restore process can continue with incremental backups until the new computer is restored to the same state as the old computer was at the last incremental backup. Then, a final incremental backup can be performed on the old computer and the old computer is shut down to prevent any further changes. Then, only the final incremental backup needs to be restored on the new computer and the invalidated volume can be finalized and exposed to the operating system.
FIG. 12 is simplified flow diagram of a process for restoring data including abackup process500 and a restoreprocess600. As mentioned earlier, the backup process may be performed on a backup computer480 (FIG. 11) and the restore process may be performed on a restorecomputer490. Alternatively, the backup process and restore process may be performed on a single computer. In addition, both thebackup process500 and the restoreprocess600, except where noted, may be performed in the background and concurrently, such that the respective computers can continue on with their normal operations.
Inoperation502 of thebackup process500, a base image of the volume to be backed up is created.Decision block504 determines if a restore process should be started on this or another computer. If so,operation506 sends507 any backup files that have not yet been sent to the computer performing the restoreprocess600. If it is the same computer, the sending may simply be pointers to the backup files. If no incremental images have been created, only the base image would be sent. On subsequent cycles through the loop ofbackup process500 incremental images may be sent from thebackup process500 to the restoreprocess600 as they become available.
Decision block508 tests to see if it is time to create a new incremental backup at a new point in time. If not, thebackup process500 loops back todecision block504. If it is time for a new incremental backup,operation510 performs the incremental backup.
Operation block512 creates a synthetic incremental backup if desired and as discussed above with reference toFIGS. 2-5.Operation block514 creates a synthetic base backup if desired and as discussed above with reference toFIGS. 2-5.
Decision block516 tests to see if completion of a restore process is desired. In the case of a migration, this test may occur at any user-defined time based on when it may be convenient to have both the backup computer and restore computer unavailable while a final incremental backup is restored. In the case of a catastrophic failure, that failure would prompt the decision to complete the restore process.
If a completion of the restore process is not desired, thebackup process500 loops back todecision block504. If a completion of the restore process is desired,operation518 sends520 the final backup to the restoreprocess600. Of course, in the case of a catastrophic failure, the final backup may not be made and the last completed incremental would have to be used as the restore point. After a final incremental is sent, the backup process ends.
The restoreprocess600 may begin concurrently with thebackup process500 any time after the base image has been received by the restoreprocess600. To begin,operation602 invalidates the volume to be restored as discussed above with reference toFIGS. 7-10. As stated above, since the volume is inaccessible and unrecognized by the standard file system, the volume may be a boot volume or a data volume.Operation604 copies the available backup images that have been sent from thebackup process500 to the invalidated volume. Decision block606 tests to see if it is time to complete the restoreprocess600. As stated above, this may be a user selected time or a forced time due to a catastrophic failure. If the restore is not to be completed at this time, the restoreprocess600 loops tooperation604 to restore more backup images if they are available. If the restore is to be completed,operation608 finalizes the volume to make it recognizable by the standard file system.
Optionally,operation610 may mount the restored volume so that the standard file system can access it. Optionally,operation612 may reboot the system if the restored volume is a boot volume. Upon rebooting, the MBR may point to a now valid volume boot record on the newly restored boot volume so the system can boot up from the restored boot volume.
Alternatively,operations608,610, and612 may be postponed. As a non-limiting example, the restore system may have multiple volumes that need to be restored. It may be desirable to wait until all the volumes have been restored before exposing the restored volumes to the standard file system, mounting the volumes, or a combination thereof.
In many modern systems a disk, a volume, or a combination thereof may be virtually represented by a file that appears to the system (through APIs or other means) as a conventional physical volume. Embodiments of the present invention may be used on such virtual volumes as explained above. In addition, the file that emulates the virtual volume may be write protected or hidden from the standard file system as a way to invalidate the virtual volume from the perspective of the standard file system.
FIG. 13 illustrates one non-limiting example for using virtual disks by using them in a virtual machine (VM)host system700. Multiplevirtual machines710 may run as processes on theVM host system700. Through theVM host system700, thesevirtual machines710 may have access to one or more virtual disks720 (and virtual volumes). TheVM host system700 may maintain thesevirtual disks720 as files for theVM host system700, but they would appear as volumes to thevirtual machines710. A backup and restoresystem750 running alongside, or within, the VM host system would be able to read, write, and invalidate thevirtual disks720 to perform restore processes as discussed above simply by locking or hiding the file representing thevirtual disk720.
Of course, the backup and restore processes may also operate as discussed above with referenceFIGS. 7-10 by operating within one of thevirtual machines710 to access thevirtual disks720.
Although this invention has been described with reference to particular embodiments, the invention is not limited to these described embodiments. Rather, the invention is limited only by the appended claims, which include within their scope all equivalent devices or methods that operate according to the principles of the invention as described.