[关闭]
@Arslan6and6 2016-09-03T13:26:16.000000Z 字数 825 阅读 932

Hive学习之WordCount单词统计

第七章、大数据仓库Hive深入


  1. [beifeng@hadoop-senior hadoop-2.5.0-cdh5.3.6]$ more sort.txt
  2. hadoop java
  3. mapreduce map
  4. reduce map
  5. yarn history
  6. hadoop yarn
  7. yarn java
  8. [beifeng@hadoop-senior hadoop-2.5.0-cdh5.3.6]$ pwd
  9. /opt/modules/hadoop-2.5.0-cdh5.3.6
  10. hive (test)> create table sort (fieldname string);
  11. load data local inpath '/opt/modules/hadoop-2.5.0-cdh5.3.6/sort.txt' into table sort;
  12. hive (test)> select * from sort;
  13. OK
  14. sort.fieldname
  15. hadoop java
  16. mapreduce map
  17. reduce map
  18. yarn history
  19. hadoop yarn
  20. yarn java
  21. hive (test)> create table words (word string);
  22. //将按行分割后的数据加载入 表words
  23. hive (test)> insert overwrite table words select explode(split(fieldname,'[\t]')) word from sort;
  24. hive (test)> select * from words;
  25. OK
  26. words.word
  27. hadoop
  28. java
  29. mapreduce
  30. map
  31. reduce
  32. map
  33. yarn
  34. history
  35. hadoop
  36. yarn
  37. yarn
  38. java
  39. hive (test)> select word,count(word) count from words group by word;
  40. word count
  41. hadoop 2
  42. history 1
  43. java 1
  44. java 1
  45. map 2
  46. mapreduce 1
  47. reduce 1
  48. yarn 3
添加新批注
在作者公开此批注前,只有你和作者可见。
回复批注