Mikulas Patocka | 7eada90 | 2017-01-04 20:23:53 +0100 | [diff] [blame] | 1 | The dm-integrity target emulates a block device that has additional |
| 2 | per-sector tags that can be used for storing integrity information. |
| 3 | |
| 4 | A general problem with storing integrity tags with every sector is that |
| 5 | writing the sector and the integrity tag must be atomic - i.e. in case of |
| 6 | crash, either both sector and integrity tag or none of them is written. |
| 7 | |
| 8 | To guarantee write atomicity, the dm-integrity target uses journal, it |
| 9 | writes sector data and integrity tags into a journal, commits the journal |
| 10 | and then copies the data and integrity tags to their respective location. |
| 11 | |
| 12 | The dm-integrity target can be used with the dm-crypt target - in this |
| 13 | situation the dm-crypt target creates the integrity data and passes them |
| 14 | to the dm-integrity target via bio_integrity_payload attached to the bio. |
| 15 | In this mode, the dm-crypt and dm-integrity targets provide authenticated |
| 16 | disk encryption - if the attacker modifies the encrypted device, an I/O |
| 17 | error is returned instead of random data. |
| 18 | |
| 19 | The dm-integrity target can also be used as a standalone target, in this |
| 20 | mode it calculates and verifies the integrity tag internally. In this |
| 21 | mode, the dm-integrity target can be used to detect silent data |
| 22 | corruption on the disk or in the I/O path. |
| 23 | |
| 24 | |
| 25 | When loading the target for the first time, the kernel driver will format |
| 26 | the device. But it will only format the device if the superblock contains |
| 27 | zeroes. If the superblock is neither valid nor zeroed, the dm-integrity |
| 28 | target can't be loaded. |
| 29 | |
| 30 | To use the target for the first time: |
| 31 | 1. overwrite the superblock with zeroes |
| 32 | 2. load the dm-integrity target with one-sector size, the kernel driver |
| 33 | will format the device |
| 34 | 3. unload the dm-integrity target |
| 35 | 4. read the "provided_data_sectors" value from the superblock |
| 36 | 5. load the dm-integrity target with the the target size |
| 37 | "provided_data_sectors" |
| 38 | 6. if you want to use dm-integrity with dm-crypt, load the dm-crypt target |
| 39 | with the size "provided_data_sectors" |
| 40 | |
| 41 | |
| 42 | Target arguments: |
| 43 | |
| 44 | 1. the underlying block device |
| 45 | |
| 46 | 2. the number of reserved sector at the beginning of the device - the |
| 47 | dm-integrity won't read of write these sectors |
| 48 | |
| 49 | 3. the size of the integrity tag (if "-" is used, the size is taken from |
| 50 | the internal-hash algorithm) |
| 51 | |
| 52 | 4. mode: |
| 53 | D - direct writes (without journal) - in this mode, journaling is |
| 54 | not used and data sectors and integrity tags are written |
| 55 | separately. In case of crash, it is possible that the data |
| 56 | and integrity tag doesn't match. |
| 57 | J - journaled writes - data and integrity tags are written to the |
| 58 | journal and atomicity is guaranteed. In case of crash, |
| 59 | either both data and tag or none of them are written. The |
| 60 | journaled mode degrades write throughput twice because the |
| 61 | data have to be written twice. |
Mikulas Patocka | c2bcb2b | 2017-03-17 12:40:51 -0400 | [diff] [blame] | 62 | R - recovery mode - in this mode, journal is not replayed, |
| 63 | checksums are not checked and writes to the device are not |
| 64 | allowed. This mode is useful for data recovery if the |
| 65 | device cannot be activated in any of the other standard |
| 66 | modes. |
Mikulas Patocka | 7eada90 | 2017-01-04 20:23:53 +0100 | [diff] [blame] | 67 | |
| 68 | 5. the number of additional arguments |
| 69 | |
| 70 | Additional arguments: |
| 71 | |
Mikulas Patocka | 56b67a4 | 2017-04-18 16:51:50 -0400 | [diff] [blame] | 72 | journal_sectors:number |
Mikulas Patocka | 7eada90 | 2017-01-04 20:23:53 +0100 | [diff] [blame] | 73 | The size of journal, this argument is used only if formatting the |
| 74 | device. If the device is already formatted, the value from the |
| 75 | superblock is used. |
| 76 | |
Mikulas Patocka | 56b67a4 | 2017-04-18 16:51:50 -0400 | [diff] [blame] | 77 | interleave_sectors:number |
Mikulas Patocka | 7eada90 | 2017-01-04 20:23:53 +0100 | [diff] [blame] | 78 | The number of interleaved sectors. This values is rounded down to |
| 79 | a power of two. If the device is already formatted, the value from |
| 80 | the superblock is used. |
| 81 | |
Mikulas Patocka | 56b67a4 | 2017-04-18 16:51:50 -0400 | [diff] [blame] | 82 | buffer_sectors:number |
Mikulas Patocka | 7eada90 | 2017-01-04 20:23:53 +0100 | [diff] [blame] | 83 | The number of sectors in one buffer. The value is rounded down to |
| 84 | a power of two. |
| 85 | |
| 86 | The tag area is accessed using buffers, the buffer size is |
| 87 | configurable. The large buffer size means that the I/O size will |
| 88 | be larger, but there could be less I/Os issued. |
| 89 | |
Mikulas Patocka | 56b67a4 | 2017-04-18 16:51:50 -0400 | [diff] [blame] | 90 | journal_watermark:number |
Mikulas Patocka | 7eada90 | 2017-01-04 20:23:53 +0100 | [diff] [blame] | 91 | The journal watermark in percents. When the size of the journal |
| 92 | exceeds this watermark, the thread that flushes the journal will |
| 93 | be started. |
| 94 | |
Mikulas Patocka | 56b67a4 | 2017-04-18 16:51:50 -0400 | [diff] [blame] | 95 | commit_time:number |
Mikulas Patocka | 7eada90 | 2017-01-04 20:23:53 +0100 | [diff] [blame] | 96 | Commit time in milliseconds. When this time passes, the journal is |
| 97 | written. The journal is also written immediatelly if the FLUSH |
| 98 | request is received. |
| 99 | |
Mikulas Patocka | 56b67a4 | 2017-04-18 16:51:50 -0400 | [diff] [blame] | 100 | internal_hash:algorithm(:key) (the key is optional) |
Mikulas Patocka | 7eada90 | 2017-01-04 20:23:53 +0100 | [diff] [blame] | 101 | Use internal hash or crc. |
| 102 | When this argument is used, the dm-integrity target won't accept |
| 103 | integrity tags from the upper target, but it will automatically |
| 104 | generate and verify the integrity tags. |
| 105 | |
| 106 | You can use a crc algorithm (such as crc32), then integrity target |
| 107 | will protect the data against accidental corruption. |
| 108 | You can also use a hmac algorithm (for example |
| 109 | "hmac(sha256):0123456789abcdef"), in this mode it will provide |
| 110 | cryptographic authentication of the data without encryption. |
| 111 | |
| 112 | When this argument is not used, the integrity tags are accepted |
| 113 | from an upper layer target, such as dm-crypt. The upper layer |
| 114 | target should check the validity of the integrity tags. |
| 115 | |
Mikulas Patocka | a3fcf72 | 2018-07-03 20:13:33 +0200 | [diff] [blame] | 116 | recalculate |
| 117 | Recalculate the integrity tags automatically. It is only valid |
| 118 | when using internal hash. |
| 119 | |
Mikulas Patocka | 56b67a4 | 2017-04-18 16:51:50 -0400 | [diff] [blame] | 120 | journal_crypt:algorithm(:key) (the key is optional) |
Mikulas Patocka | 7eada90 | 2017-01-04 20:23:53 +0100 | [diff] [blame] | 121 | Encrypt the journal using given algorithm to make sure that the |
| 122 | attacker can't read the journal. You can use a block cipher here |
| 123 | (such as "cbc(aes)") or a stream cipher (for example "chacha20", |
| 124 | "salsa20", "ctr(aes)" or "ecb(arc4)"). |
| 125 | |
| 126 | The journal contains history of last writes to the block device, |
| 127 | an attacker reading the journal could see the last sector nubmers |
| 128 | that were written. From the sector numbers, the attacker can infer |
| 129 | the size of files that were written. To protect against this |
| 130 | situation, you can encrypt the journal. |
| 131 | |
Mikulas Patocka | 56b67a4 | 2017-04-18 16:51:50 -0400 | [diff] [blame] | 132 | journal_mac:algorithm(:key) (the key is optional) |
Mikulas Patocka | 7eada90 | 2017-01-04 20:23:53 +0100 | [diff] [blame] | 133 | Protect sector numbers in the journal from accidental or malicious |
| 134 | modification. To protect against accidental modification, use a |
| 135 | crc algorithm, to protect against malicious modification, use a |
| 136 | hmac algorithm with a key. |
| 137 | |
| 138 | This option is not needed when using internal-hash because in this |
| 139 | mode, the integrity of journal entries is checked when replaying |
| 140 | the journal. Thus, modified sector number would be detected at |
| 141 | this stage. |
| 142 | |
Mikulas Patocka | 9d609f85 | 2017-04-18 16:51:52 -0400 | [diff] [blame] | 143 | block_size:number |
| 144 | The size of a data block in bytes. The larger the block size the |
| 145 | less overhead there is for per-block integrity metadata. |
| 146 | Supported values are 512, 1024, 2048 and 4096 bytes. If not |
| 147 | specified the default block size is 512 bytes. |
Mikulas Patocka | 7eada90 | 2017-01-04 20:23:53 +0100 | [diff] [blame] | 148 | |
Mikulas Patocka | 56b67a4 | 2017-04-18 16:51:50 -0400 | [diff] [blame] | 149 | The journal mode (D/J), buffer_sectors, journal_watermark, commit_time can |
Mikulas Patocka | 7eada90 | 2017-01-04 20:23:53 +0100 | [diff] [blame] | 150 | be changed when reloading the target (load an inactive table and swap the |
| 151 | tables with suspend and resume). The other arguments should not be changed |
| 152 | when reloading the target because the layout of disk data depend on them |
| 153 | and the reloaded target would be non-functional. |
| 154 | |
| 155 | |
| 156 | The layout of the formatted block device: |
| 157 | * reserved sectors (they are not used by this target, they can be used for |
| 158 | storing LUKS metadata or for other purpose), the size of the reserved |
| 159 | area is specified in the target arguments |
| 160 | * superblock (4kiB) |
| 161 | * magic string - identifies that the device was formatted |
| 162 | * version |
| 163 | * log2(interleave sectors) |
| 164 | * integrity tag size |
| 165 | * the number of journal sections |
| 166 | * provided data sectors - the number of sectors that this target |
| 167 | provides (i.e. the size of the device minus the size of all |
| 168 | metadata and padding). The user of this target should not send |
| 169 | bios that access data beyond the "provided data sectors" limit. |
Mikulas Patocka | 56b67a4 | 2017-04-18 16:51:50 -0400 | [diff] [blame] | 170 | * flags - a flag is set if journal_mac is used |
Mikulas Patocka | 7eada90 | 2017-01-04 20:23:53 +0100 | [diff] [blame] | 171 | * journal |
| 172 | The journal is divided into sections, each section contains: |
| 173 | * metadata area (4kiB), it contains journal entries |
| 174 | every journal entry contains: |
| 175 | * logical sector (specifies where the data and tag should |
| 176 | be written) |
| 177 | * last 8 bytes of data |
| 178 | * integrity tag (the size is specified in the superblock) |
| 179 | every metadata sector ends with |
| 180 | * mac (8-bytes), all the macs in 8 metadata sectors form a |
| 181 | 64-byte value. It is used to store hmac of sector |
| 182 | numbers in the journal section, to protect against a |
| 183 | possibility that the attacker tampers with sector |
| 184 | numbers in the journal. |
| 185 | * commit id |
| 186 | * data area (the size is variable; it depends on how many journal |
| 187 | entries fit into the metadata area) |
| 188 | every sector in the data area contains: |
| 189 | * data (504 bytes of data, the last 8 bytes are stored in |
| 190 | the journal entry) |
| 191 | * commit id |
| 192 | To test if the whole journal section was written correctly, every |
| 193 | 512-byte sector of the journal ends with 8-byte commit id. If the |
| 194 | commit id matches on all sectors in a journal section, then it is |
| 195 | assumed that the section was written correctly. If the commit id |
| 196 | doesn't match, the section was written partially and it should not |
| 197 | be replayed. |
| 198 | * one or more runs of interleaved tags and data. Each run contains: |
| 199 | * tag area - it contains integrity tags. There is one tag for each |
| 200 | sector in the data area |
| 201 | * data area - it contains data sectors. The number of data sectors |
| 202 | in one run must be a power of two. log2 of this value is stored |
| 203 | in the superblock. |