您的位置:首页 > 产品设计 > UI/UE

Leetcode: Word Frequency

2015-05-10 21:50 190 查看

Question

Write a bash script to calculate the frequency of each word in a text file words.txt.

For simplicity sake, you may assume:

words.txt contains only lowercase characters and space ’ ’ characters.

Each word must consist of lowercase characters only.

Words are separated by one or more whitespace characters.

For example, assume that words.txt has the following content:

the day is sunny the the

the sunny is is

Your script should output the following, sorted by descending frequency:

the 4

is 3

sunny 2

day 1

Note:

Don’t worry about handling ties, it is guaranteed that each word’s frequency count is unique.

My Solution

using array in awk is common to use

awk ‘{for(i=1;i<=NF;i++){arr[$i] +=1;}} END{for(i in arr){print i,arr[i] | “sort -nr -k2”};}’ words.txt

If using pipeline,

awk ‘{for(i=1;i<=NF;i++){arr[$i] +=1;}} END{for(i in arr){print i,arr[i] }’ words.txt | sort -nr -k2

sort -r: in reverse order, -k2: sort in the second column

Other′s Solution

cat words.txt | tr -s ′ ′ ′\n′ | sort | uniq -c | sort -rn | awk ‘{print $2″ ″$1}’

tr -s ′ ′ ′\n′: substitute ′\n′ for ′ ′

uniq -c: count the same one
内容来自用户分享和网络整理,不保证内容的准确性,如有侵权内容,可联系管理员处理 点击这里给我发消息
标签: