windowindex.py

Get star and end indexes of windows when subdividing the text extent.

script usage

The script help presents the available parameters that might be used.

$ ./windowindex.py -h
usage: windowindex.py [-h] [-i FILE] [--start START] [--stop STOP]
                      [--token {word,char,line}]
                      [--wtype {cumulative,sliding}] [--nwin NWIN]
                      [--wscale {linear,log,log10,log2}]

optional arguments:
  -h, --help            show this help message and exit
  -i FILE               Input text file.
  --start START         Start index (default = 0).
  --stop STOP           Stop index (default = file_length - 1).
  --token {word,char,line}
                        Choose a token structures.
  --wtype {cumulative,sliding}
                        Window type.
  --nwin NWIN           Number of windows
  --wscale {linear,log,log10,log2}
                        Window scale.

usage examples

Some usage examples are presented below.

$ ./windowindex.py -i alice.txt --token line --nwin 10 --wscale log --wtype sliding
     2
     5
     11
    26
    58
    130
   293
   659
   1484
  3340

$ ./windowindex.py -i alice.txt --token line --nwin 10 --wscale linear --wtype sliding
     334
   668
   1002
  1336
  1670
  2004
  2338
  2672
  3006
  3340

back