spaCy, Natural Language Processing library

Notice

« 2025/04 »
일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30

Tags more

Archives

Today

Total

관리 메뉴

Shakerato

spaCy, Natural Language Processing library 본문

Research

spaCy, Natural Language Processing library

Shakeratto 2018. 7. 1. 20:48

spaCy (https://spacy.io/):

Industrial-Strength Natural Language Processing Python Library

It's commercial open-source software, released under the MIT license.

spaCy github: https://github.com/explosion/spaCy

- Features

Non-destructive tokenization

Named entity recognition

Support for 28+ languages

13 statistical models for 8 languages

Pre-trained word vectors

Easy deep learning integration

Part-of-speech tagging

Labelled dependency parsing

Syntax-driven sentence segmentation

Built in visualizers for syntax and NER

Convenient string-to-hash mapping

Export to numpy data arrays

Efficient binary serialization

Easy model packaging and deployment

State-of-the-art speed

Robust, rigorously evaluated accuracy

- How to use it?

1. Install spacy in 'cmd' as administrater

pip install spacy

2. Download language model ('en' : 37.4M)

python -m spacy download en

* You can choose other models here: https://spacy.io/models/en

* If you want to delete the model,

Go to '\site-packages\spacy\data',

(If you use Anaconda,
C:\ProgramData\Anaconda3\envs\<envname>\Lib\site-packages\spacy\data)

delete the folder name 'en'

3. Try it (using python)

import spacy

nlp = spacy.load('en')

doc = nlp(u'This is a sentence.')

print(doc.text)

for token in doc:

print(token.text, token.pos_, token.dep_)

4. Try TorchTextTutorial (using spaCy)

https://github.com/mjc92/TorchTextTutorial

저작자표시 비영리

'Research' 카테고리의 다른 글

유튜브 다크모드(테마) 변경 (0)	2018.07.24
Install face_recognition and dlib on Windows 10 (0)	2018.07.15
docker 기본 명령어 (0)	2018.07.01
Text to speech(tts) in linux (with python code) (0)	2018.06.30
Run code the DL based object detection on raspberry pi 2 (0)	2018.06.30

'Research' Related Articles

Comments

Shakerato

spaCy, Natural Language Processing library 본문

spaCy, Natural Language Processing library

'Research' 카테고리의 다른 글

티스토리툴바