Giter Club home page Giter Club logo

mdlynch37 / text-analytics-with-python Goto Github PK

View Code? Open in Web Editor NEW

This project forked from dipanjans/text-analytics-with-python

0.0 1.0 0.0 28.36 MB

Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.

License: Apache License 2.0

Python 100.00%

text-analytics-with-python's Introduction

Text Analytics with Python

A Practical Real-World Approach to Gaining Actionable Insights from your Data

Text analytics can be a bit overwhelming and frustrating at times with the unstructured and noisy nature of textual data and the vast amount of information available. "Text Analytics with Python" is a book packed with 385 pages of useful information based on techniques, algorithms, experiences and various lessons learnt over time in analyzing text data. This repository contains datasets and code used in this book. I will also be adding various notebooks and bonus content here from time to time. Keep watching this space!

Help Needed on porting code to Python 3.x. Please check this link -- To be resumed end of August-September, 2017.

TODO

  • Add code used in the book
  • Add datasets used in the book
  • Add book description
  • Update chapter descriptions
  • Add necessary code comments & documentation
  • Add code used in the book ported to Python 3.x (for people using Python 3)
  • Add bonus content

Get the book






About the book

Book Cover

Derive useful insights from your data using Python. Learn the techniques related to natural language processing and text analytics, and gain the skills to know which technique is best suited to solve a particular problem.

Text Analytics with Python teaches you both basic and advanced concepts, including text and language syntax, structure, semantics. You will focus on algorithms and techniques, such as text classification, clustering, topic modeling, and text summarization

A structured and comprehensive approach is followed in this book so that readers with little or no experience do not find themselves overwhelmed. You will start with the basics of natural language and Python and move on to advanced analytical and machine learning concepts. You will look at each technique and algorithm with both a bird's eye view to understand how it can be used as well as with a microscopic view to understand the mathematical concepts and to implement them to solve your own problems.

Edition: 1st   Pages: 385   Language: English
Book Title: Text Analytics with Python   Publisher: Apress (a part of Springer)   Copyright: Dipanjan Sarkar
Print ISBN: 978-1-4842-2387-1   Online ISBN: 978-1-4842-2388-8   DOI: 10.1007/978-1-4842-2388-8

This book:

  • Provides complete coverage of the major concepts and techniques of natural language processing (NLP) and text analytics
  • Includes practical real-world examples of techniques for implementation, such as building a text classification system to categorize news articles, analyzing app or game reviews using topic modeling and text summarization, and clustering popular movie synopses and analyzing the sentiment of movie reviews
  • Shows implementations based on Python and several popular open source libraries in NLP and text analytics, such as the natural language toolkit (nltk), gensim, scikit-learn, spaCy and pattern

Contents

  • Chapter 1: Natural Language Basics
  • Chapter 2: Python Refresher
  • Chapter 3: Processing and Understanding Text
  • Chapter 4: Text Classification
  • Chapter 5: Text Summarization
  • Chapter 6: Text Similarity and Clustering
  • Chapter 7: Semantic and Sentiment Analysis

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.