In question 1:
Edit:
SELECT
corpus, corpus_date, word_count, SUM(word_count)
OVER
(PARTITION BY corpus, corpus_date
ORDER BY word_count, word DESC) AS running_sum
FROM [publicdata:samples.shakespeare]
To:
SELECT
corpus, corpus_date, word_count, SUM(word_count)
OVER
(PARTITION BY corpus, corpus_date
ORDER BY word_count DESC, word) AS running_sum
FROM [publicdata:samples.shakespeare]
(The original query is sorted by word, but you want to sort it by word_count)
source
share