Commit Graph

5 Commits (36f3c4740d8c8ab826fa087db1ba6b1113937d3e)

Author SHA1 Message Date
Xiao Tianci 0659473535 add TruncateSequencePair, ToNumber C++ API and enable three test cases
4 years ago
xulei2020 18b519ae0f add sentence piece
5 years ago
qianlong cae77c0c22 BasicTokenizer not case fold on preserverd words
5 years ago
qianlong 4f16f036be Add WhitespaceTokenizer and UnicodeScriptTokenizer for nlp
5 years ago
qianlong 451c20a6f5 Add UnicodeCharTokenizer for nlp
5 years ago