Ruby 배열에서 동일한 문자열 요소를 계산하는 방법

Question 1

나는 다음이있다 Array = ["Jason", "Jason", "Teresa", "Judah", "Michelle", "Judah", "Judah", "Allison"]

동일한 각 요소에 대한 개수를 어떻게 생성 합니까?

Where:
"Jason" = 2, "Judah" = 3, "Allison" = 1, "Teresa" = 1, "Michelle" = 1?

또는 해시를 생성합니다 .

위치 : hash = { "Jason"=> 2, "Judah"=> 3, "Allison"=> 1, "Teresa"=> 1, "Michelle"=> 1}

Question 2

names = ["Jason", "Jason", "Teresa", "Judah", "Michelle", "Judah", "Judah", "Allison"]
counts = Hash.new(0)
names.each { |name| counts[name] += 1 }
# => {"Jason" => 2, "Teresa" => 1, ....

Question 3

names.inject(Hash.new(0)) { |total, e| total[e] += 1 ;total}

당신에게 준다

{"Jason"=>2, "Teresa"=>1, "Judah"=>3, "Michelle"=>1, "Allison"=>1}

Question 4

Ruby v2.7 이상 (최신)

(2019 12월 발표) 루비 v2.7.0로, 핵심 언어는 이제 포함 Enumerable#tally- 새로운 방법 이 문제를 위해 특별히 설계된를 :

names = ["Jason", "Jason", "Teresa", "Judah", "Michelle", "Judah", "Judah", "Allison"]

names.tally
#=> {"Jason"=>2, "Teresa"=>1, "Judah"=>3, "Michelle"=>1, "Allison"=>1}

Ruby v2.4 이상 (현재 지원되지만 이전 버전)

이 질문이 처음 질문되었을 때 (2011 년 2 월) 표준 루비에서는 다음 코드를 사용할 수 없었습니다.

Object#itself, Ruby v2.2.0 (2014 년 12 월 출시)에 추가되었습니다.
Hash#transform_values, 이는 Ruby v2.4.0 (2016 년 12 월 출시)에 추가되었습니다.

Ruby에 대한 이러한 최신 추가 기능을 통해 다음 구현이 가능합니다.

names = ["Jason", "Jason", "Teresa", "Judah", "Michelle", "Judah", "Judah", "Allison"]

names.group_by(&:itself).transform_values(&:count)
#=> {"Jason"=>2, "Teresa"=>1, "Judah"=>3, "Michelle"=>1, "Allison"=>1}

Ruby v2.2 이상 (지원 중단됨)

위에서 언급 한 Hash#transform_values방법에 액세스하지 않고 이전 루비 버전을 사용하는 경우 대신 Array#to_hRuby v2.1.0 (2013 년 12 월 릴리스)에 추가 된을 사용할 수 있습니다 .

names.group_by(&:itself).map { |k,v| [k, v.length] }.to_h
#=> {"Jason"=>2, "Teresa"=>1, "Judah"=>3, "Michelle"=>1, "Allison"=>1}

이전 루비 버전 ( <= 2.1)의 경우이를 해결할 수있는 여러 가지 방법이 있지만 (제 생각에는) 명확한 "최상의"방법은 없습니다. 이 게시물에 대한 다른 답변을 참조하십시오.

Question 5

이제 Ruby 2.2.0을 사용하여 itself방법을 활용할 수 있습니다 .

names = ["Jason", "Jason", "Teresa", "Judah", "Michelle", "Judah", "Judah", "Allison"]
counts = {}
names.group_by(&:itself).each { |k,v| counts[k] = v.length }
# counts > {"Jason"=>2, "Teresa"=>1, "Judah"=>3, "Michelle"=>1, "Allison"=>1}

Question 6

실제로이를 수행하는 데이터 구조가 MultiSet있습니다..

불행히도 MultiSetRuby 코어 라이브러리 또는 표준 라이브러리 에는 구현 이 없지만 웹 주위에 떠 다니는 두 가지 구현이 있습니다.

이것은 데이터 구조의 선택이 알고리즘을 단순화 할 수있는 방법을 보여주는 좋은 예입니다. 실제로이 특정 예에서는 알고리즘이 완전히 사라집니다. 말 그대로 다음과 같습니다.

Multiset.new(*names)

그리고 그게 다야. https://GitHub.Com/Josh/Multimap/ 사용 예 :

require 'multiset'

names = %w[Jason Jason Teresa Judah Michelle Judah Judah Allison]

histogram = Multiset.new(*names)
# => #<Multiset: {"Jason", "Jason", "Teresa", "Judah", "Judah", "Judah", "Michelle", "Allison"}>

histogram.multiplicity('Judah')
# => 3

http://maraigue.hhiro.net/multiset/index-en.php 사용 예 :

require 'multiset'

names = %w[Jason Jason Teresa Judah Michelle Judah Judah Allison]

histogram = Multiset[*names]
# => #<Multiset:#2 'Jason', #1 'Teresa', #3 'Judah', #1 'Michelle', #1 'Allison'>

Question 7

Enumberable#each_with_object 최종 해시를 반환하지 않아도됩니다.

names.each_with_object(Hash.new(0)) { |name, hash| hash[name] += 1 }

보고:

=> {"Jason"=>2, "Teresa"=>1, "Judah"=>3, "Michelle"=>1, "Allison"=>1}

Question 8

Ruby 2.7 이상

Enumerable#tally이 정확한 목적을 위해 Ruby 2.7이 도입 되었습니다. 여기에 좋은 요약이 있습니다 .

이 사용 사례에서 :

array.tally
# => { "Jason" => 2, "Judah" => 3, "Allison" => 1, "Teresa" => 1, "Michelle" => 1 }

출시되는 기능에 대한 문서는 여기에 있습니다 .

이것이 누군가를 돕기를 바랍니다!

Question 9

작동합니다.

arr = ["Jason", "Jason", "Teresa", "Judah", "Michelle", "Judah", "Judah", "Allison"]
result = {}
arr.uniq.each{|element| result[element] = arr.count(element)}

Question 10

다음은 약간 더 기능적인 프로그래밍 스타일입니다.

array_with_lower_case_a = ["Jason", "Jason", "Teresa", "Judah", "Michelle", "Judah", "Judah", "Allison"]
hash_grouped_by_name = array_with_lower_case_a.group_by {|name| name}
hash_grouped_by_name.map{|name, names| [name, names.length]}
=> [["Jason", 2], ["Teresa", 1], ["Judah", 3], ["Michelle", 1], ["Allison", 1]]

의 한 가지 장점은 group_by동일한 항목을 그룹화하는 데 사용할 수 있다는 것입니다.

another_array_with_lower_case_a = ["Jason", "jason", "Teresa", "Judah", "Michelle", "Judah Ben-Hur", "JUDAH", "Allison"]
hash_grouped_by_first_name = another_array_with_lower_case_a.group_by {|name| name.split(" ").first.capitalize}
hash_grouped_by_first_name.map{|first_name, names| [first_name, names.length]}
=> [["Jason", 2], ["Teresa", 1], ["Judah", 3], ["Michelle", 1], ["Allison", 1]]

Question 11

a = [1, 2, 3, 2, 5, 6, 7, 5, 5]
a.each_with_object(Hash.new(0)) { |o, h| h[o] += 1 }

# => {1=>1, 2=>2, 3=>1, 5=>3, 6=>1, 7=>1}

신용 Frank Wambutt

Question 12

names = ["Jason", "Jason", "Teresa", "Judah", "Michelle", "Judah", "Judah", "Allison"]
Hash[names.group_by{|i| i }.map{|k,v| [k,v.size]}]
# => {"Jason"=>2, "Teresa"=>1, "Judah"=>3, "Michelle"=>1, "Allison"=>1}

Question 13

여기에 많은 훌륭한 구현이 있습니다.

그러나 초보자로서 나는 이것이 읽고 구현하기 가장 쉬운 것이라고 생각할 것입니다.

names = ["Jason", "Jason", "Teresa", "Judah", "Michelle", "Judah", "Judah", "Allison"]

name_frequency_hash = {}

names.each do |name|
  count = names.count(name)
  name_frequency_hash[name] = count  
end
#=> {"Jason"=>2, "Teresa"=>1, "Judah"=>3, "Michelle"=>1, "Allison"=>1}

우리가 취한 조치 :

우리는 해시를 만들었습니다.
우리는 names배열 을 반복했습니다.
names배열 에 각 이름이 몇 번이나 나타 났는지
우리는을 사용하여 키를 생성 name하고 값을 사용하여count

약간 더 장황 할 수 있지만 (성능면에서는 키를 재정 의하여 불필요한 작업을 수행하게 될 것입니다), 제 생각에는 달성하려는 것을 읽고 이해하기가 더 쉽습니다.

Question 14

이것은 대답 이라기보다는 코멘트에 가깝지만, 코멘트는 그것을 정의하지 않습니다. 이렇게하면 Array = foo적어도 하나의 IRB 구현이 중단됩니다.

C:\Documents and Settings\a.grimm>irb
irb(main):001:0> Array = nil
(irb):1: warning: already initialized constant Array
=> nil
C:/Ruby19/lib/ruby/site_ruby/1.9.1/rbreadline.rb:3177:in `rl_redisplay': undefined method `new' for nil:NilClass (NoMethodError)
        from C:/Ruby19/lib/ruby/site_ruby/1.9.1/rbreadline.rb:3873:in `readline_internal_setup'
        from C:/Ruby19/lib/ruby/site_ruby/1.9.1/rbreadline.rb:4704:in `readline_internal'
        from C:/Ruby19/lib/ruby/site_ruby/1.9.1/rbreadline.rb:4727:in `readline'
        from C:/Ruby19/lib/ruby/site_ruby/1.9.1/readline.rb:40:in `readline'
        from C:/Ruby19/lib/ruby/1.9.1/irb/input-method.rb:115:in `gets'
        from C:/Ruby19/lib/ruby/1.9.1/irb.rb:139:in `block (2 levels) in eval_input'
        from C:/Ruby19/lib/ruby/1.9.1/irb.rb:271:in `signal_status'
        from C:/Ruby19/lib/ruby/1.9.1/irb.rb:138:in `block in eval_input'
        from C:/Ruby19/lib/ruby/1.9.1/irb/ruby-lex.rb:189:in `call'
        from C:/Ruby19/lib/ruby/1.9.1/irb/ruby-lex.rb:189:in `buf_input'
        from C:/Ruby19/lib/ruby/1.9.1/irb/ruby-lex.rb:103:in `getc'
        from C:/Ruby19/lib/ruby/1.9.1/irb/slex.rb:205:in `match_io'
        from C:/Ruby19/lib/ruby/1.9.1/irb/slex.rb:75:in `match'
        from C:/Ruby19/lib/ruby/1.9.1/irb/ruby-lex.rb:287:in `token'
        from C:/Ruby19/lib/ruby/1.9.1/irb/ruby-lex.rb:263:in `lex'
        from C:/Ruby19/lib/ruby/1.9.1/irb/ruby-lex.rb:234:in `block (2 levels) in each_top_level_statement'
        from C:/Ruby19/lib/ruby/1.9.1/irb/ruby-lex.rb:230:in `loop'
        from C:/Ruby19/lib/ruby/1.9.1/irb/ruby-lex.rb:230:in `block in each_top_level_statement'
        from C:/Ruby19/lib/ruby/1.9.1/irb/ruby-lex.rb:229:in `catch'
        from C:/Ruby19/lib/ruby/1.9.1/irb/ruby-lex.rb:229:in `each_top_level_statement'
        from C:/Ruby19/lib/ruby/1.9.1/irb.rb:153:in `eval_input'
        from C:/Ruby19/lib/ruby/1.9.1/irb.rb:70:in `block in start'
        from C:/Ruby19/lib/ruby/1.9.1/irb.rb:69:in `catch'
        from C:/Ruby19/lib/ruby/1.9.1/irb.rb:69:in `start'
        from C:/Ruby19/bin/irb:12:in `<main>'

C:\Documents and Settings\a.grimm>

Array클래스 이기 때문 입니다.

Question 15

arr = ["Jason", "Jason", "Teresa", "Judah", "Michelle", "Judah", "Judah", "Allison"]

arr.uniq.inject({}) {|a, e| a.merge({e => arr.count(e)})}

경과 시간 0.028 밀리 초

흥미롭게도 stupidgeek의 구현은 다음과 같이 벤치마킹되었습니다.

경과 시간 0.041 밀리 초

그리고이기는 대답 :

경과 시간 0.011 밀리 초

:)