Package: textpress 1.0.0

Jason Timm

textpress: A Lightweight and Versatile NLP Toolkit

A simple Natural Language Processing (NLP) toolkit focused on search-centric workflows with minimal dependencies. The package offers key features for web scraping, text processing, corpus search, and text embedding generation via the 'HuggingFace API' <https://huggingface.co/docs/api-inference/index>.

Authors:Jason Timm [aut, cre]

textpress_1.0.0.tar.gz
textpress_1.0.0.zip(r-4.5)textpress_1.0.0.zip(r-4.4)textpress_1.0.0.zip(r-4.3)
textpress_1.0.0.tgz(r-4.4-any)textpress_1.0.0.tgz(r-4.3-any)
textpress_1.0.0.tar.gz(r-4.5-noble)textpress_1.0.0.tar.gz(r-4.4-noble)
textpress_1.0.0.tgz(r-4.4-emscripten)textpress_1.0.0.tgz(r-4.3-emscripten)
textpress.pdf |textpress.html
textpress/json (API)

# Install 'textpress' in R:
install.packages('textpress', repos = c('https://jaytimm.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

Bug tracker:https://github.com/jaytimm/textpress/issues

On CRAN:

corpus-searchnlpopenai-embeddingsweb-scraping

4.45 score 3 stars 279 downloads 19 exports 32 dependencies

Last updated 1 months agofrom:9c5e7a8d68. Checks:OK: 7. Indexed: yes.

TargetResultDate
Doc / VignettesOKNov 14 2024
R-4.5-winOKNov 14 2024
R-4.5-linuxOKNov 14 2024
R-4.4-winOKNov 14 2024
R-4.4-macOKNov 14 2024
R-4.3-winOKNov 14 2024
R-4.3-macOKNov 14 2024

Exports:.decode_duckduckgo_urls.extract_links.get_site.process_bing.process_yahooabbreviationsapi_huggingface_embeddingsextract_datenlp_build_chunksnlp_cast_tokensnlp_melt_tokensnlp_split_paragraphsnlp_split_sentencesnlp_tokenize_textsem_nearest_neighborssem_search_corpusstandardize_dateweb_scrape_urlsweb_search

Dependencies:askpassclicpp11curldata.tablefansigenericsgluehttrjsonlitelatticelifecyclelubridatemagrittrMatrixmimeopensslpbapplypillarpkgconfigR6rlangrvestselectrstringistringrsystibbletimechangeutf8vctrsxml2