Word Co-occurrence

mubashshir

March 23, 2012

Lets say we have a database table called responses, each row contains a word.
responses Table:

id	positive	response
1	true	I have a great experience. I was treated very well. The person was very nice
2	false	I had a terrible experience. I was not treated very well. I thought person was very mean.

We map give each word an id on one table. Lets call it the words table.
words Table:

id	positive	word	count
1	true	experience	1
2	false	experience	1
3	true	I	2
4	false	I	3

We go to each row, we get all the words, if the word does not exist in the words table we add the word. (A new id will be created associated with that word)
Then we get every combination of 2 words in that paragraph and add it to a occurrences table.
occurrences Table:

word1_id	word2_id	count
1	3	2
2	4	3

Question: Do we want to count the same word in the same sentence more than once in relationships? The word ‘I’ and ‘experience’ occur three times together in that second sentence?
Basically we than get all the true occurrences and rank them by count. Same with false occurrences and we can present them however we want.
Blogs / Pictures (of what I might want)
High Resolution Maps of Science
Papers

Algorithms extracting linguistic relations and their evaluation
Text Algorithms
Text Mining
Rapid Miner – open source data mining, java based, has filtering options.
Kind of Related But Very Interesting
Visual Thesaurus – We could do something similar to this but you also pick a minimum threshold and it shows all the word related that meet it.

January 10, 2012

Word Co-occurrence

Related Articles

Map of reddit land – very cool

Samples & Demos | Ext JS 4 | Products | Sencha

NCAA Mens Basketball Tournament Bracket, 2012 – The Power Rank