Incryptography, apreimage attack oncryptographic hash functions tries to find amessage that has a specific hash value. A cryptographic hash function should resist attacks on itspreimage (set of possible inputs).
In the context of attack, there are two types of preimage resistance:
These can be compared with acollision resistance, in which it is computationally infeasible to find any two distinct inputsx,x′ that hash to the same output; i.e., such thath(x) =h(x′).[1]
Collision resistance implies second-preimage resistance, but does not guarantee preimage resistance.[1] However, under certain assumptions of the range of the hash function, collision resistance does imply preimage resistance (by a provisional implication)[1]. Conversely, a second-preimage attack implies a collision attack (trivially, since, in addition tox′,x is already known right from the start). Via the provisional implication, a pre-image attack will also imply a second-preimage attack, which then also extends to a collision attack.
By definition, an ideal hash function is such that the fastest way to compute a first or second preimage is through abrute-force attack. For ann-bit hash, this attack has atime complexity2n, which is considered too high for a typical output size ofn = 128 bits. If such complexity is the best that can be achieved by an adversary, then the hash function is considered preimage-resistant. However, there is a general result thatquantum computers perform a structured preimage attack in, which also implies second preimage[2] and thus a collision attack.
Faster preimage attacks can be found bycryptanalysing certain hash functions, and are specific to that function. Some significant preimage attacks have already been discovered, but they are not yet practical. If a practical preimage attack is discovered, it would drastically affect many Internet protocols. In this case, "practical" means that it could be executed by an attacker with a reasonable amount of resources. For example, a preimaging attack that costs trillions of dollars and takes decades to preimage one desired hash value or one message is not practical; one that costs a few thousand dollars and takes a few weeks might be very practical.
All currently known practical or almost-practical attacks[3][4] onMD5 andSHA-1 arecollision attacks.[5] In general, a collision attack is easier to mount than a preimage attack, as it is not restricted by any set value (any two values can be used to collide). The time complexity of a brute-force collision attack, in contrast to the preimage attack, is only.
The computational infeasibility of a first preimage attack on an ideal hash function assumes that the set of possible hash inputs is too large for a brute force search. However if a given hash value is known to have been produced from a set of inputs that is relatively small or is ordered by likelihood in some way, then a brute force search may be effective. Practicality depends on the input set size and the speed or cost of computing the hash function.
A common example is the use of hashes to storepassword validation data for authentication. Rather than store the plaintext of user passwords, an access control system stores a hash of the password. When a user requests access, the password they submit is hashed and compared with the stored value. If the stored validation data is stolen, the thief will only have the hash values, not the passwords. However most users choose passwords in predictable ways and many passwords are short enough that all possible combinations can be tested if fast hashes are used, even if the hash is rated secure against preimage attacks.[6] Special hashes calledkey derivation functions have been created to slow searches.SeePassword cracking. For a method to prevent the testing of short passwords seesalt (cryptography).