This is the documentation webpage for the clscripts,
a collection of computational linguistics scripts
written in different languages (bash, python, octave, R, C, etc).
Table of Contents
wordcounttfl.sh
count the occurrence of words in a text file
entropy.py
estimate the Shannon’s entropy for a list of counts
heapslaw.py
extract vocabulary size from different lengths of a text file