Package: textpress 1.0.0
Jason Timm
textpress: A Lightweight and Versatile NLP Toolkit
A simple Natural Language Processing (NLP) toolkit focused on search-centric workflows with minimal dependencies. The package offers key features for web scraping, text processing, corpus search, and text embedding generation via the 'HuggingFace API' <https://huggingface.co/docs/api-inference/index>.
Authors:
textpress_1.0.0.tar.gz
textpress_1.0.0.zip(r-4.5)textpress_1.0.0.zip(r-4.4)textpress_1.0.0.zip(r-4.3)
textpress_1.0.0.tgz(r-4.4-any)textpress_1.0.0.tgz(r-4.3-any)
textpress_1.0.0.tar.gz(r-4.5-noble)textpress_1.0.0.tar.gz(r-4.4-noble)
textpress_1.0.0.tgz(r-4.4-emscripten)textpress_1.0.0.tgz(r-4.3-emscripten)
textpress.pdf |textpress.html✨
textpress/json (API)
# Install 'textpress' in R: |
install.packages('textpress', repos = c('https://jaytimm.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/jaytimm/textpress/issues
corpus-searchnlpopenai-embeddingsweb-scraping
Last updated 1 months agofrom:9c5e7a8d68. Checks:OK: 7. Indexed: yes.
Target | Result | Date |
---|---|---|
Doc / Vignettes | OK | Nov 14 2024 |
R-4.5-win | OK | Nov 14 2024 |
R-4.5-linux | OK | Nov 14 2024 |
R-4.4-win | OK | Nov 14 2024 |
R-4.4-mac | OK | Nov 14 2024 |
R-4.3-win | OK | Nov 14 2024 |
R-4.3-mac | OK | Nov 14 2024 |
Exports:.decode_duckduckgo_urls.extract_links.get_site.process_bing.process_yahooabbreviationsapi_huggingface_embeddingsextract_datenlp_build_chunksnlp_cast_tokensnlp_melt_tokensnlp_split_paragraphsnlp_split_sentencesnlp_tokenize_textsem_nearest_neighborssem_search_corpusstandardize_dateweb_scrape_urlsweb_search
Dependencies:askpassclicpp11curldata.tablefansigenericsgluehttrjsonlitelatticelifecyclelubridatemagrittrMatrixmimeopensslpbapplypillarpkgconfigR6rlangrvestselectrstringistringrsystibbletimechangeutf8vctrsxml2