Anyone here knows a tool where you can drop PDFs etc in, have it run things like "named entity recognition" etc. and network the extracted concepts?
I used some open source tool years ago that came from an investigative journalism background but can't seem to find it anymore (and blanking on the name)
What‘s your go-to #python or #rstats tool(chain) for splitting #German compounds? I‘ve tried a few but was not really satisfied. Maybe I missed something. #NLP #linguistics
@sascha_wolfer Have you looked into Holmes? It’s build on top of #spacy and I remember it being able to extract tokens from compound words: https://github.com/richardpaulhudson/holmes-extractor