hadoop-dfs-dir.txt
上传用户:quxuerui
上传日期:2018-01-08
资源大小:41811k
文件大小:3k
- #
- # This is a readme for hadoop-version-dfs-dir.tgz and hadoop-dfs-dir.txt.
- #
- # See HADOOP-1629 for more info if needed.
- # These two files are used by unit test TestDFSUpgradeFromImage.java
- #
- # hadoop-14-dfs-dir.tgz :
- # ---------------------
- # This file contains the HDFS directory structure for one namenode and 4 datanodes.
- # The structure is setup similar to the structure used in MiniDFSCluster.
- # The directory was created with Hadoo-0.14.x.
- #
- # In the test, this directory is unpacked and MiniDFSCluster is run with
- # "-upgrade" option. The test waits for the upgrade to complete
- # (leave safe mode) and then all the files are read. The test checks that the
- # directory structure and file checksums exactly match the information
- # in this file.
- #
- # hadoop-dfs-dir.txt :
- # ---------------------
- # Along with this description this file contains the expected files and
- # checksums or the files in the upgraded DFS.
- #
- # The original DFS directory was created with various types of files and with
- # some recoverable errors (i.e. corrupt or missing .crc files).
- #
- # A similar set of files exist in two different DFS directories.
- # For e.g. "top-dir-1Mb-512" contains files created with dfs.block.size of 1Mb
- # and io.bytes.per.checksum of 512.
- #
- # In the future, when Hadoop project no longer supports upgrade from
- # Hadoop-0.12, then a new DFS directory image must be created.
- #
- # To generate checksum info for new files :
- # ---------------------------------------
- # Uncomment the last coment (starts with "printChecksums") and run the
- # test again. When the test sees this line, it prints the checksum
- # information that should replace the checksum information in
- # this file. When run in this mode, the test will fail with a descriptive IOException.
- #
- # Next, extract the checksum info from the test log like this:
- # sed -n 's/.*CRC info for reference file : //p' test-log.txt >> this_file
- # This will append a new list of files and checksums to this file. Be sure to remove the existing checksum info.
- #
- # For your reference, the format of the checksum info below is "filename whitespace*twhitespace* checksumn"
- #
- # Uncomment the following line to produce checksum info for a new DFS image.
- #printChecksums
- /1kb-multiple-checksum-blocks-64-16 191893480
- /top-dir-120000-60/1Mb-file 4079112547
- /top-dir-120000-60/4k-file 3716287280
- /top-dir-120000-60/5Mb-file 2563834633
- /top-dir-120000-60/directory1/500thousand-file 3036538664
- /top-dir-120000-60/directory1/file-with-corrupt-crc 1984689737
- /top-dir-120000-60/directory1/file-with-no-crc 4004594475
- /top-dir-120000-60/directory1/zero1 0
- /top-dir-120000-60/zerolen 0
- /top-dir-1Mb-512/1Mb-file 4079112547
- /top-dir-1Mb-512/4k-file 3716287280
- /top-dir-1Mb-512/5Mb-file 2563834633
- /top-dir-1Mb-512/directory1/500thousand-file 3036538664
- /top-dir-1Mb-512/directory1/file-with-corrupt-crc 1984689737
- /top-dir-1Mb-512/directory1/file-with-no-crc 4004594475
- /top-dir-1Mb-512/directory1/zero1 0
- /top-dir-1Mb-512/zerolen 0
- overallCRC 1419480698