✏️
wiki
  • Introduction
  • Career
    • Data Science
      • Machine Learning
      • Natural Language Processing
      • Datasets
      • Computer Vision
      • Data Engineering
      • Web Scraping
      • Data Visualisation
    • Robotics
      • Localisation
        • Kalman Filtering
    • Web Development
      • React
  • Tech
    • Apps
      • Mac OS
        • Magnet
        • PDF Expert
        • PixelSnap
        • Digital Colour Meter
        • Daisy Disk
        • Alfred
        • Bartender
        • iStat Menus
      • Linux
        • Playerctl
      • Terminal Emulators
      • Text Editors
      • Shell
    • CLI
      • Git & Github
      • Monitoring
    • Desktop Customisation
  • Lifestyle
    • In progress... :)
  • Blog
    • Why I Migrated from Windows to Linux
Powered by GitBook
On this page
  • Introduction
  • Resources

Was this helpful?

  1. Career
  2. Data Science

Natural Language Processing

Giving computers the ability to interpret sentences and text as well as humans do.

PreviousMachine LearningNextDatasets

Last updated 4 years ago

Was this helpful?

Introduction

Natural Language Processing (NLP) is studying algorithms and methods of giving computers the ability to interpret sentences and text as well as humans do. This is similar to Computer Vision, except rather than images and videos, the goal is to understand text. Examples include sentiment analysis and topic modelling. Popular applications include chatbots or conversational agents.

Common techniques in NLP pre-processing include:

  • Tokenisation: Breaking up a paragraph into a list of words for future processing.

  • Stop words: Commonly used words such as "the", "a" and "for", which usually are removed during text pre-processing as they have no significant meaning in use-cases such as sentiment analysis.

  • Lemmatisation: Grouping different forms of the same word into one category. For example, organised, organises, and organising all refer to the base word organise, and have the same meaning.

Resources

  • Popular open source libraries for NLP include:

  • AWS technologies include:

    • : Find insights and relationships in text.

    • : Building conversational interfaces using voice and text.

  • Google Cloud technologies include:

    • is a Google tool aimed to quickly create conversational agents which can easily interface with websites such as Facebook, Skype.

  • Microsoft Azure technologies include:

spaCy
NLTK
Stanford CoreNLP
TextBlob
Amazon Comprehend
Amazon Lex
Natural Language API
Dialogflow
Text Analytics