How to notice unusual news activities

Suppose you were able to track news about different people, for example, “Steve Jobs” and “Steve Ballmer”.

In what ways could you determine if the number of references to an entity over a given period of time was unusual compared to their usual degree of frequency of occurrence?

I believe that for a more popular person like Steve Jobs, a 50% increase may be unusual (an increase from 1000 to 1500), while for a relatively unknown CEO, an increase of 1000% for a given day is possible (an increase of 2 up to 200). If you didn’t have a way to scale, your uniqueness index could dominate the unheard of, gaining them 15 minutes of fame.

update. To make it more understandable, it is assumed that you can already receive a continuous stream of news and identify entities in each news item and store all this in a relational data warehouse.

+5
source share
4 answers

You can use a moving average . This is how many trackers work. By tracking the last points of data n , you can see if this change was a significant change outside their usual variance.

- : (m), ( δ), ( z), z = m * δ. ( m0 - m):

Name                m    m0    δ    z
Steve Jobs       4950  4500    .10      495
Steve Ballmer     400   300    .33      132
Larry Ellison      50    10    4.0      400
Andy Nobody        50    40    .20      10

400% - az 400, 10% - 495, 20% 10. , , , , .

+3
  • . , .
  • ( ).
  • , , X .

, , X. X = 1

+2

-   , 24 , . .

  , ? , ? , ? Apple , , ?

, 50 , . , Google 24 . , .

0

. , , , - , , , , -.

, , , , ,

  • . , , , , ( 1930- ).

  • ( ), , - , , () , ( ). () , . , ; 30 .

  • (.. ). , , , , .. .

  • , , . ? ( ) , . , - 5% - , , . , . .

: . 15 000 , 2000 300 , . , , , ; . , stationary - , .

, , . , ( - , ). ... , , . , : (), ( ) - ( , -, ).

, . StackExchange, , .

0
source

All Articles