python之单词统计(words count)
2014-03-19 18:46
459 查看
wdCount.py内容如下:
#!/usr/bin/python
import sys
def print_words(filename):
fp = open(filename, "r")
text = fp.read()
print "text content is :\n" + text
fp.close()
split_text = text.split()
out_list =[]
out_dict ={}
for it in split_text:
if it in out_list:
out_dict[it] = 1 + out_dict[it]
else:
out_list.append(it)
out_dict[it] = 1
print '\nprint :'
#return out_dict
print out_dict
def main():
try:
if len(sys.argv) != 3:
print 'usage: ./wordcount.py {--count | --topcount} file'
sys.exit (1)
except SystemExit:
print "sys.argv is to less"
option = sys.argv[1]
filename = sys.argv[2]
if option== '--count':
print_words (filename)
elif option == '--topcount':
print_top(filename)
else:
print 'unknown option: ' + option
try:
sys.exit(1)
except SystemExit:
print "unknown option"
if __name__== '__main__':
main()
测试文件word.txt内容如下:
song song ok ok name is OK
执行过程及结果如下:
text@redhat#./wdCount.py --count ./word.txt
text content is :
song song ok ok name is OK
print :
{'OK': 1, 'is': 1, 'ok': 2, 'name': 1, 'song': 2}
#!/usr/bin/python
import sys
def print_words(filename):
fp = open(filename, "r")
text = fp.read()
print "text content is :\n" + text
fp.close()
split_text = text.split()
out_list =[]
out_dict ={}
for it in split_text:
if it in out_list:
out_dict[it] = 1 + out_dict[it]
else:
out_list.append(it)
out_dict[it] = 1
print '\nprint :'
#return out_dict
print out_dict
def main():
try:
if len(sys.argv) != 3:
print 'usage: ./wordcount.py {--count | --topcount} file'
sys.exit (1)
except SystemExit:
print "sys.argv is to less"
option = sys.argv[1]
filename = sys.argv[2]
if option== '--count':
print_words (filename)
elif option == '--topcount':
print_top(filename)
else:
print 'unknown option: ' + option
try:
sys.exit(1)
except SystemExit:
print "unknown option"
if __name__== '__main__':
main()
测试文件word.txt内容如下:
song song ok ok name is OK
执行过程及结果如下:
text@redhat#./wdCount.py --count ./word.txt
text content is :
song song ok ok name is OK
print :
{'OK': 1, 'is': 1, 'ok': 2, 'name': 1, 'song': 2}
相关文章推荐
- WordsCount(词频统计)-Python语言编写!
- python第三节课堂练习:统计一句单词
- 运行Hadoop自带的wordcount单词统计程序
- 利用python内置函数,快速统计单词在文本中出现的次数
- python字典用法-统计统计一句单词
- python 统计单词出现次数
- Python实现统计文本当中单词的数量,
- hadoop实例分析之WordCount单词统计分析
- Hadoop Demo 1 ——WordCount 统计文章中单词的个数
- 布同 统计英文单词的个数的python代码
- Python 基础 - 统计文本里单词的个数以及出现的次数
- python用字典来统计单词或汉字词的个数
- Python(2.7.x)实现简单的单词频数统计
- 启动Spark Shell,在Spark Shell中编写WordCount程序,在IDEA中编写WordCount的Maven程序,spark-submit使用spark的jar来做单词统计
- python统计文本字符串里单词出现频率的方法
- Python实现统计单词出现的个数
- [python每日一练]--0006:单词统计
- 用array_count_values统计一篇英文文档中每个单词的出现次数,结果用表格展示出来
- hadoop的统计单词程序WordCount
- 用python统计多个文本中你想统计的单词