GBs and TBs of data is common, not for this task. All you are doing is Word Sens...

		cybernytrix on Dec 2, 2010 \| parent \| context \| favorite \| on: Python NLTK Bayesian Classifier for word sense dis... GBs and TBs of data is common, not for this task. All you are doing is Word Sense Disambiguation and there are algorithms to do WSD that work with much much smaller training sets. Just don't think that the exponential increase in training data is justified...