Skip to content

Commit eded1bf

Browse files
committedJan 24, 2020
reading in de dataset
1 parent 7610d52 commit eded1bf

File tree

5 files changed

+11520
-6109
lines changed

5 files changed

+11520
-6109
lines changed
 

‎.ipynb_checkpoints/nltk_overview-checkpoint.ipynb

+66
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,71 @@
11
{
22
"cells": [
3+
{
4+
"cell_type": "code",
5+
"execution_count": 1,
6+
"metadata": {},
7+
"outputs": [
8+
{
9+
"name": "stdout",
10+
"output_type": "stream",
11+
"text": [
12+
"showing info https://raw.githubusercontent.com/nltk/nltk_data/gh-pages/index.xml\n"
13+
]
14+
},
15+
{
16+
"data": {
17+
"text/plain": [
18+
"True"
19+
]
20+
},
21+
"execution_count": 1,
22+
"metadata": {},
23+
"output_type": "execute_result"
24+
}
25+
],
26+
"source": [
27+
"import nltk\n",
28+
"nltk.download()"
29+
]
30+
},
31+
{
32+
"cell_type": "code",
33+
"execution_count": 3,
34+
"metadata": {},
35+
"outputs": [],
36+
"source": [
37+
"# dir(nltk)"
38+
]
39+
},
40+
{
41+
"cell_type": "code",
42+
"execution_count": 6,
43+
"metadata": {},
44+
"outputs": [],
45+
"source": [
46+
"rawData = open('data/SMSSpamCollection.csv').read()"
47+
]
48+
},
49+
{
50+
"cell_type": "code",
51+
"execution_count": 8,
52+
"metadata": {},
53+
"outputs": [
54+
{
55+
"data": {
56+
"text/plain": [
57+
"'label,text\\n0.0,Go until jurong point, crazy.. Available only in bugis n great world la e buffet... Cine there got amore wat...\\n0.0,Ok lar... Joking wif u oni...\\n1.0,Free entry in 2 a wkly comp to win FA Cup final tkts 21st May 2005. Text FA to 87121 to receive entry questionstd txt rateTCs apply 08452810075over18s\\n0.0,U dun say so early hor... U c already then say...\\n0.0,Nah I dont think he goes to usf, he lives around here though\\n1.0,FreeMsg Hey there darling its been 3 weeks now and no word ba'"
58+
]
59+
},
60+
"execution_count": 8,
61+
"metadata": {},
62+
"output_type": "execute_result"
63+
}
64+
],
65+
"source": [
66+
"rawData[0:500]"
67+
]
68+
},
369
{
470
"cell_type": "code",
571
"execution_count": null,

0 commit comments

Comments
 (0)
Please sign in to comment.