How to count line words in ruby

Question

How to count line words in ruby

I want to do something like this

def get_count(string) sentence.split(' ').count end

I think there may be a better way, the string may have an inline method for this.

+4

ruby

mko Jun 21 '11 at 11:07

source share

9 answers

Candleide · Answer 1 · 2011-06-21T11:10:54+0000

I believe that count is a function, so you probably want to use length.

 def get_count(string) sentence.split(' ').length end

Edit: if your string is really long, creating an array from it with any splitting will require more memory, so here is a faster way:

 def get_count(string) (0..(string.length-1)).inject(1){|m,e| m += string[e].chr == ' ' ? 1 : 0 } end

steenslag · Answer 2 · 2011-06-21T11:42:03+0000

If the only word boundary is one space, just count them.

 puts "this sentence has five words".count(' ')+1 # => 5

If there are spaces between words, line endings, tabs, comma, and then space, etc., then the ability to scan word boundaries is possible:

 puts "this, is./tfour words".scan(/\b/).size/2

Mohamad · Answer 3 · 2014-10-22T16:51:14+0000

I know this is an old question, but it can help someone stumble here. Word counting is a difficult problem. What is a word? Are numbers and special characters considered words? Etc ...

I wrote words_counted for this purpose. This is a very flexible, custom string analyzer. You can ask him to analyze any string to count words, occurrences of words and exclude words / characters using regular expressions, strings and arrays.

 counter = WordsCounted::Counter.new("Hello World!", exclude: "World") counter.word_count #=> 1 counted.words #=> ["Hello"]

Etc ...

The documentation and full source are on Github .

Samuel müller · Answer 4 · 2011-06-21T11:13:51+0000

using regex will also cover several spaces:

 sentence.split(/\S+/).size

Syed aslam · Answer 5 · 2011-06-21T11:15:16+0000

The string has nothing ready to do what you wanted. You can define a method in your class or extend the String class itself for what you want to do:

 def word_count( string ) return 0 if string.empty? string.split.size end

Pavling · Answer 6 · 2011-06-21T11:38:06+0000

Regex is broken into any non-primary character:

 string.split(/\W+/).size

... although it does apostrophe using count as two words, so depending on how small the margin of error is, you can create your own regex expression.

Andrew Grimm · Answer 7 · 2011-06-22T23:34:22+0000

I recently discovered that String # count is faster than breaking a string an order of magnitude more .

Unfortunately, String # count only accepts a string, not a regular expression. In addition, it will consider two adjacent spaces as two things, not one thing, and you will have to handle the other space characters separately.

user495470 · Answer 8 · 2011-10-14T22:37:40+0000

 p " some word\nother\tword.word|word".strip.split(/\s+/).size #=> 4

leviathan · Answer 9 · 2012-04-20T12:23:07+0000

I would rather look at the word boundaries:

 "Lorem Lorem Lorem".scan(/\w+/).size => 3

If you need to match rock and roll as one word, you can do the following:

 "Lorem Lorem Lorem rock-and-roll".scan(/[\w-]+/).size => 4

How to count line words in ruby

More articles: