11TEXT: Numerous tools for text processing
22========================================
33
4+ <img align =right src =" https://travis-ci.org/mit-nlp/Text.jl.svg?branch=master " alt =" Build Status " />
5+
46This package is a julia implementation of:
57
6- 1 . Text classification based on BoW models
8+ 1 . Text classification based on BoW models (e.g. topic/langauge id)
792 . Language ID (training and processing) based on word and character n-grams
8103 . Lewis's SMART stop list for English
9114 . tfidf/tfllr text feature normalization
@@ -12,10 +14,25 @@ This package is a julia implementation of:
1214Prerequistes
1315------------
1416
15- ` Stage ` - Needed for logging and memoization
16- ` ollam ` - online learning modules
17- ` Devectorize ` - macro-based devectorization
18- ` DataStructures ` - for DefaultDict
17+ - ` Stage ` - Needed for logging and memoization * (Note: requires manual install)*
18+ - ` Ollam ` - online learning modules * (Note: requires manual install)*
19+ - ` Devectorize ` - macro-based devectorization
20+ - ` DataStructures ` - for DefaultDict
21+ - ` Devectorize `
22+ - ` GZip `
23+ - ` Iterators ` - for iterator helper functions
24+
25+ Install
26+ -------
27+
28+ This is an experimental package which is not currently registered in
29+ the julia central repository. You can install via:
30+
31+ ``` julia
32+ Pkg. clone (" https://github.com/saltpork/Stage.jl" )
33+ Pkg. clone (" https://github.com/mit-nlp/Ollam.jl" )
34+ Pkg. clone (" https://github.com/mit-nlp/Text.jl" )
35+ ```
1936
2037License
2138-------
0 commit comments