Bits of Language: corpus linguistics, NLP and text analytics
  • Corpus Linguistics
  • Tutorials
  • Text Complexity

Resources and links of interest

Archive of links gathered during my PhD thesis:

  1. Linguistics and NLP
  2. Corpus Linguistics
  3. Perl
  4. LaTeX
  5. R
  6. PhD related
  7. Misc.

Update: Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German

1 – Linguistics and NLP

General Linguistics

  • Glottopedia, the free encyclopedia of Linguistics (project)
  • Resource List of the Linguistic Society of America
  • The Linguist List
  • General Linguistics Internet Resources (Joaquim Llisterri, Universitat Autònoma de Barcelona)

Computational Linguistics

  • Natural Language Processing FAQ
  • comp.text Frequently Asked Questions (Usenet archive)
  • Language Technology World
  • Linguistics Computing Resources on the Internet
  • Pattern Matching Pointers
  • Natural Language Software Registry
  • Semantic …
more ...

About Adrien Barbaresi
I'm a research scientist at the
Berlin-Brandenburg Academy of Sciences

Welcome to my academic blog about web corpora, text mining, computational linguistics and digital humanities.

  • Social

    • Twitter
    • LinkedIn
    • GitHub
  • Tags

    • bibliography
    • code snippet
    • corpus linguistics
    • data mining
    • readability assessment
    • research
    • text cleaning
    • trafilatura
    • web corpus construction
    • web crawling
  • Links

    • Homepage
    • Scientific Publications
    • Web text collections (DWDS)
    • Center for Digital Lexicography of German (ZDL)

© 2010 Adrien Barbaresi · Powered by pelican-bootstrap3, Pelican, Bootstrap

Creative Commons License Content licensed under a Creative Commons Attribution-ShareAlike 4.0 International License, except where indicated otherwise.

Back to top