Validating import results in a cluster
View the results of an import in the Cassandra File System.
Use this command to view the results in the Cassandra File System:
./dse hadoop fs -ls /npa_nxx
Depending on the number of DataStax Enterprise analytics nodes and task tracker configuration, the output shows a number of files in the directory, part-m-0000n, where 'n' ranges from 0 to the number of tasks that were executed as part of the Hadoop job.
The contents of these files can be viewed using this command:
./dse hadoop fs -cat /npa_nxx/part-m-00000
By varying the number of tasks (the 00000), the output looks something like this:
361991,361,991,27.73,097.40,L,TX,Corpus Christi 361992,361,992,27.73,097.40,L,TX,Corpus Christi 361993,361,993,27.73,097.40,L,TX,Corpus Christi 361994,361,994,27.73,097.40,L,TX,Corpus Christi 361998,361,998,27.79,097.90,L,TX,Agua Dulce 361999,361,999,27.80,097.40,W,TX,Padre Island National Seashore
As shown in the output, the CSV file format that Sqoop requires does not include optional spaces in the delimiter.