Bookmark
LibreTranslate: Free and Open Source Machine Translation API
https://github.com/uav4geo/LibreTranslate, posted 13 Feb by peter in api free language nlp opensource software
Free and Open Source Machine Translation API, entirely self-hosted. Unlike other APIs, it doesn't rely on proprietary providers such as Google or Azure to perform translations.
Bookmark
EleutherAI - GPT-Neo
https://www.eleuther.ai/gpt-neo, posted 18 Jan by peter in ai free nlp opensource
GPT-Neo is the code name for a series of transformer-based language models loosely styled around the GPT architecture that we plan to train and open source. Our primary goal is to replicate a GPT-3 sized model and open source it to the public, for free.
Bookmark
Apache Tika
https://tika.apache.org/, posted 10 Dec by peter in free language nlp opensource search software
The Apache Tika™ toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF). All of these file types can be parsed through a single interface, making Tika useful for search engine indexing, content analysis, translation, and much more.
Bookmark
Tone Analyzer
https://tone-analyzer-demo.mybluemix.net/, posted 2016 by peter in ai demo language nlp online text writing
This service uses linguistic analysis to detect and interpret emotions, social tendencies, and language style cues found in text.
Bookmark
wooorm/franc
https://github.com/wooorm/franc, posted 2014 by peter in development free language nlp opensource python software
Detect the language of text.
Bookmark
TextBlob: Simplified Text Processing — TextBlob 0.5.0 documentation
https://textblob.readthedocs.org/en/latest/, posted 2013 by peter in development free language nlp python software toread
TextBlob is a Python (2 and 3) library for processing textual data. It provides a simple API for diving into common natural language processing (NLP) tasks such as part-of-speech tagging, noun phrase extraction, sentiment analysis, translation, and more.
Bookmark
An Efficient Way to Extract the Main Topics from a Sentence | The Tokenizer
thetokenizer.com/2013/05/09/efficient-way-to-extract-the-main-topics-of-a-sentence/, posted 2013 by peter in language nlp python toread
Last week, while working on new features for our product, I had to find a quick and efficient way to extract the main topics/objects from a sentence. Since I’m using Python, I initially thought that it’s going to be a very easy task to achieve with NLTK. However, when I tried its default tools (POS tagger, Parser…), I indeed got quite accurate results, but performance was pretty bad. So I had to find a better way. Like I did in my previous post, I’ll start with the bottom line – Here you can find my code for extracting the main topics/noun phrases from a given sentence. It works fine with real sentences (from a blog/news article). It’s a bit less accurate compared to the default NLTK tools, but it works much faster!
Bookmark
translate.google.com/toolkit, posted 2013 by peter in conversion free language nlp online
Google Translator Toolkit is a powerful and easy-to-use editor that helps translators work faster and better.
Bookmark
Delver - a natural language interface to your app
delver.io/, posted 2013 by peter in development language nlp software toread
Down in the depths of your organisation, you have a treasure-trove of valuable data. But how hard is it for your users to retrieve it? Salvage your data with a natural language interface - ask your app English questions, get clear answers and reports back.
Bookmark
High Scalability - High Scalability - DuckDuckGo Architecture - 1 Million Deep Searches a Day and Growing
highscalability.com/blog/2013/1/28/duckduckgo-architecture-1-million-deep-searches-a-day-and-gr.html, posted 2013 by peter in development nlp scalability search
This is an interview with Gabriel Weinberg, founder of Duck Duck Go and general all around startup guru, on what DDG’s architecture looks like in 2012.