.../data/stopwords/stemPOS.dat

stemPOS.dat

The languages listed below employ the use of the stemPOS.dat file to keep stemming from interfering with certain Parts of Speech (POS).

  • Slovakian
  • Slovenian
  • Hungarian
  • Czech
  • Croatian

Without this file POS, such as numbers, would be incorrectly stemmed leading to decreased accuracy in these languages.