{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# HW2: VECTORIZATION (Pandas style!)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## STEP 1: Import ALL the things\n", "### Import libraries " ] }, { "cell_type": "code", "execution_count": 2, "metadata": {}, "outputs": [], "source": [ "##########################################\n", "# NOTE: I'm toying with the idea of requiring the library just above \n", "# when I use it so it makes more sense in context\n", "##########################################\n", "# import os\n", "# import pandas as pd\n", "# from nltk.tokenize import word_tokenize, sent_tokenize\n", "# from nltk.sentiment import SentimentAnalyzer\n", "# from nltk.sentiment.util import *\n", "# from nltk.probability import FreqDist\n", "# from nltk.sentiment.vader import SentimentIntensityAnalyzer\n", "# sid = SentimentIntensityAnalyzer()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Import data from files" ] }, { "cell_type": "code", "execution_count": 3, "metadata": {}, "outputs": [], "source": [ "import os\n", "def get_data_from_files(path):\n", " directory = os.listdir(path)\n", " results = []\n", " for file in directory:\n", " f=open(path+file)\n", " results.append(f.read())\n", " f.close()\n", " return results\n", "\n", "neg = get_data_from_files('../NEG_JK/')\n", "pos = get_data_from_files('../POS_JK/')" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## STEP 2: Prep Data\n", "### STEP 2a: Turn that fresh text into a pandas DF" ] }, { "cell_type": "code", "execution_count": 4, "metadata": {}, "outputs": [], "source": [ "import pandas as pd\n", "neg_df = pd.DataFrame(neg)\n", "pos_df = pd.DataFrame(pos)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### STEP 2b: Label it" ] }, { "cell_type": "code", "execution_count": 5, "metadata": {}, "outputs": [], "source": [ "pos_df['PoN'] = 'P'\n", "neg_df['PoN'] = 'N'" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### STEP 2c: Combine the dfs" ] }, { "cell_type": "code", "execution_count": 6, "metadata": {}, "outputs": [], "source": [ "all_df = neg_df.append(pos_df)" ] }, { "cell_type": "code", "execution_count": 7, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", " | 0 | \n", "PoN | \n", "
---|---|---|
0 | \n", "Missed Opportunity\\nI had been very excited t... | \n", "N | \n", "
1 | \n", "5/5 for Phoenix's acting..\\nI don't think the... | \n", "N | \n", "
2 | \n", "Everyone praised an overrated movie.\\nOverrat... | \n", "N | \n", "
3 | \n", "What idiotic FIlm\\nI can say that Phoenix is ... | \n", "N | \n", "
4 | \n", "Terrible\\nThe only thing good about this movi... | \n", "N | \n", "
... | \n", "... | \n", "... | \n", "
118 | \n", "Nerve-wracking, but in very uncomfortable way... | \n", "P | \n", "
119 | \n", "Solid film but there are glaring problems\\nOk... | \n", "P | \n", "
120 | \n", "Joker > Endgame\\nNeed I say more? Everything ... | \n", "P | \n", "
121 | \n", "Absolutely not a 10\\nStrong fanboy and hype r... | \n", "P | \n", "
122 | \n", "Overhyped, but it's alright\\nIt's a good film... | \n", "P | \n", "
246 rows × 2 columns
\n", "\n", " | 0 | \n", "PoN | \n", "sentences | \n", "num_sentences | \n", "tokens | \n", "num_tokens | \n", "
---|---|---|---|---|---|---|
0 | \n", "Missed Opportunity\\nI had been very excited t... | \n", "N | \n", "[ Missed Opportunity\\nI had been very excited ... | \n", "1 | \n", "[missed, opportunity, i, had, been, very, exci... | \n", "306 | \n", "
1 | \n", "5/5 for Phoenix's acting..\\nI don't think the... | \n", "N | \n", "[ 5/5 for Phoenix's acting.., I don't think th... | \n", "5 | \n", "[for, phoenix, i, do, think, there, was, a, ne... | \n", "59 | \n", "
2 | \n", "Everyone praised an overrated movie.\\nOverrat... | \n", "N | \n", "[ Everyone praised an overrated movie., Overra... | \n", "2 | \n", "[everyone, praised, an, overrated, movie, over... | \n", "26 | \n", "
3 | \n", "What idiotic FIlm\\nI can say that Phoenix is ... | \n", "N | \n", "[ What idiotic FIlm\\nI can say that Phoenix is... | \n", "4 | \n", "[what, idiotic, film, i, can, say, that, phoen... | \n", "66 | \n", "
4 | \n", "Terrible\\nThe only thing good about this movi... | \n", "N | \n", "[ Terrible\\nThe only thing good about this mov... | \n", "9 | \n", "[terrible, the, only, thing, good, about, this... | \n", "124 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
118 | \n", "Nerve-wracking, but in very uncomfortable way... | \n", "P | \n", "[ Nerve-wracking, but in very uncomfortable wa... | \n", "8 | \n", "[but, in, very, uncomfortable, way, why, every... | \n", "57 | \n", "
119 | \n", "Solid film but there are glaring problems\\nOk... | \n", "P | \n", "[ Solid film but there are glaring problems\\nO... | \n", "13 | \n", "[solid, film, but, there, are, glaring, proble... | \n", "628 | \n", "
120 | \n", "Joker > Endgame\\nNeed I say more? Everything ... | \n", "P | \n", "[ Joker > Endgame\\nNeed I say more?, Everythin... | \n", "5 | \n", "[joker, endgame, need, i, say, more, everythin... | \n", "83 | \n", "
121 | \n", "Absolutely not a 10\\nStrong fanboy and hype r... | \n", "P | \n", "[ Absolutely not a 10\\nStrong fanboy and hype ... | \n", "5 | \n", "[absolutely, not, a, strong, fanboy, and, hype... | \n", "81 | \n", "
122 | \n", "Overhyped, but it's alright\\nIt's a good film... | \n", "P | \n", "[ Overhyped, but it's alright\\nIt's a good fil... | \n", "3 | \n", "[overhyped, but, it, alright, it, a, good, fil... | \n", "60 | \n", "
246 rows × 6 columns
\n", "\n", " | 0 | \n", "PoN | \n", "sentences | \n", "num_sentences | \n", "tokens | \n", "num_tokens | \n", "no_sw | \n", "num_no_sw | \n", "
---|---|---|---|---|---|---|---|---|
0 | \n", "Missed Opportunity\\nI had been very excited t... | \n", "N | \n", "[ Missed Opportunity\\nI had been very excited ... | \n", "1 | \n", "[missed, opportunity, i, had, been, very, exci... | \n", "306 | \n", "[missed, opportunity, excited, see, movie, eve... | \n", "140 | \n", "
1 | \n", "5/5 for Phoenix's acting..\\nI don't think the... | \n", "N | \n", "[ 5/5 for Phoenix's acting.., I don't think th... | \n", "5 | \n", "[for, phoenix, i, do, think, there, was, a, ne... | \n", "59 | \n", "[phoenix, think, need, super, dark, film, tbh,... | \n", "25 | \n", "
2 | \n", "Everyone praised an overrated movie.\\nOverrat... | \n", "N | \n", "[ Everyone praised an overrated movie., Overra... | \n", "2 | \n", "[everyone, praised, an, overrated, movie, over... | \n", "26 | \n", "[everyone, praised, overrated, movie, overrate... | \n", "13 | \n", "
3 | \n", "What idiotic FIlm\\nI can say that Phoenix is ... | \n", "N | \n", "[ What idiotic FIlm\\nI can say that Phoenix is... | \n", "4 | \n", "[what, idiotic, film, i, can, say, that, phoen... | \n", "66 | \n", "[idiotic, film, say, phoenix, master, actor, b... | \n", "36 | \n", "
4 | \n", "Terrible\\nThe only thing good about this movi... | \n", "N | \n", "[ Terrible\\nThe only thing good about this mov... | \n", "9 | \n", "[terrible, the, only, thing, good, about, this... | \n", "124 | \n", "[terrible, thing, good, movie, phoenixs, actin... | \n", "65 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
118 | \n", "Nerve-wracking, but in very uncomfortable way... | \n", "P | \n", "[ Nerve-wracking, but in very uncomfortable wa... | \n", "8 | \n", "[but, in, very, uncomfortable, way, why, every... | \n", "57 | \n", "[uncomfortable, way, everybody, keep, saying, ... | \n", "33 | \n", "
119 | \n", "Solid film but there are glaring problems\\nOk... | \n", "P | \n", "[ Solid film but there are glaring problems\\nO... | \n", "13 | \n", "[solid, film, but, there, are, glaring, proble... | \n", "628 | \n", "[solid, film, glaring, problems, okay, first, ... | \n", "292 | \n", "
120 | \n", "Joker > Endgame\\nNeed I say more? Everything ... | \n", "P | \n", "[ Joker > Endgame\\nNeed I say more?, Everythin... | \n", "5 | \n", "[joker, endgame, need, i, say, more, everythin... | \n", "83 | \n", "[joker, endgame, need, say, everything, movie,... | \n", "53 | \n", "
121 | \n", "Absolutely not a 10\\nStrong fanboy and hype r... | \n", "P | \n", "[ Absolutely not a 10\\nStrong fanboy and hype ... | \n", "5 | \n", "[absolutely, not, a, strong, fanboy, and, hype... | \n", "81 | \n", "[absolutely, strong, fanboy, hype, rush, going... | \n", "36 | \n", "
122 | \n", "Overhyped, but it's alright\\nIt's a good film... | \n", "P | \n", "[ Overhyped, but it's alright\\nIt's a good fil... | \n", "3 | \n", "[overhyped, but, it, alright, it, a, good, fil... | \n", "60 | \n", "[overhyped, alright, good, film, see, like, ma... | \n", "31 | \n", "
246 rows × 8 columns
\n", "\n", " | 0 | \n", "PoN | \n", "sentences | \n", "num_sentences | \n", "tokens | \n", "num_tokens | \n", "no_sw | \n", "num_no_sw | \n", "topwords_unfil | \n", "topwords_fil | \n", "freq_dist | \n", "freq_dist_unfil | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "Missed Opportunity\\nI had been very excited t... | \n", "N | \n", "[ Missed Opportunity\\nI had been very excited ... | \n", "1 | \n", "[missed, opportunity, i, had, been, very, exci... | \n", "306 | \n", "[missed, opportunity, excited, see, movie, eve... | \n", "140 | \n", "[(of, 13), (i, 12), (the, 12), (that, 10), (it... | \n", "[(movie, 3), (said, 3), (many, 3), (times, 3),... | \n", "{'missed': 2, 'opportunity': 2, 'excited': 1, ... | \n", "{'missed': 2, 'opportunity': 2, 'i': 12, 'had'... | \n", "
1 | \n", "5/5 for Phoenix's acting..\\nI don't think the... | \n", "N | \n", "[ 5/5 for Phoenix's acting.., I don't think th... | \n", "5 | \n", "[for, phoenix, i, do, think, there, was, a, ne... | \n", "59 | \n", "[phoenix, think, need, super, dark, film, tbh,... | \n", "25 | \n", "[(was, 4), (a, 3), (that, 3), (for, 2), (there... | \n", "[(dark, 2), (phoenix, 1), (think, 1), (need, 1... | \n", "{'phoenix': 1, 'think': 1, 'need': 1, 'super':... | \n", "{'for': 2, 'phoenix': 1, 'i': 1, 'do': 1, 'thi... | \n", "
2 | \n", "Everyone praised an overrated movie.\\nOverrat... | \n", "N | \n", "[ Everyone praised an overrated movie., Overra... | \n", "2 | \n", "[everyone, praised, an, overrated, movie, over... | \n", "26 | \n", "[everyone, praised, overrated, movie, overrate... | \n", "13 | \n", "[(overrated, 2), (movie, 2), (everyone, 1), (p... | \n", "[(overrated, 2), (movie, 2), (everyone, 1), (p... | \n", "{'everyone': 1, 'praised': 1, 'overrated': 2, ... | \n", "{'everyone': 1, 'praised': 1, 'an': 1, 'overra... | \n", "
3 | \n", "What idiotic FIlm\\nI can say that Phoenix is ... | \n", "N | \n", "[ What idiotic FIlm\\nI can say that Phoenix is... | \n", "4 | \n", "[what, idiotic, film, i, can, say, that, phoen... | \n", "66 | \n", "[idiotic, film, say, phoenix, master, actor, b... | \n", "36 | \n", "[(and, 4), (is, 2), (make, 2), (movie, 2), (to... | \n", "[(make, 2), (movie, 2), (idiotic, 1), (film, 1... | \n", "{'idiotic': 1, 'film': 1, 'say': 1, 'phoenix':... | \n", "{'what': 1, 'idiotic': 1, 'film': 1, 'i': 1, '... | \n", "
4 | \n", "Terrible\\nThe only thing good about this movi... | \n", "N | \n", "[ Terrible\\nThe only thing good about this mov... | \n", "9 | \n", "[terrible, the, only, thing, good, about, this... | \n", "124 | \n", "[terrible, thing, good, movie, phoenixs, actin... | \n", "65 | \n", "[(the, 5), (this, 4), (movie, 4), (it, 4), (to... | \n", "[(movie, 4), (terrible, 3), (acting, 3), (good... | \n", "{'terrible': 3, 'thing': 1, 'good': 2, 'movie'... | \n", "{'terrible': 3, 'the': 5, 'only': 1, 'thing': ... | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
118 | \n", "Nerve-wracking, but in very uncomfortable way... | \n", "P | \n", "[ Nerve-wracking, but in very uncomfortable wa... | \n", "8 | \n", "[but, in, very, uncomfortable, way, why, every... | \n", "57 | \n", "[uncomfortable, way, everybody, keep, saying, ... | \n", "33 | \n", "[(it, 4), (a, 4), (movie, 3), (in, 2), (keep, ... | \n", "[(movie, 3), (keep, 2), (saying, 2), (psycho, ... | \n", "{'uncomfortable': 1, 'way': 1, 'everybody': 1,... | \n", "{'but': 1, 'in': 2, 'very': 1, 'uncomfortable'... | \n", "
119 | \n", "Solid film but there are glaring problems\\nOk... | \n", "P | \n", "[ Solid film but there are glaring problems\\nO... | \n", "13 | \n", "[solid, film, but, there, are, glaring, proble... | \n", "628 | \n", "[solid, film, glaring, problems, okay, first, ... | \n", "292 | \n", "[(the, 35), (to, 22), (it, 16), (and, 16), (i,... | \n", "[(joker, 6), (movie, 5), (film, 4), (like, 4),... | \n", "{'solid': 1, 'film': 4, 'glaring': 1, 'problem... | \n", "{'solid': 1, 'film': 4, 'but': 5, 'there': 3, ... | \n", "
120 | \n", "Joker > Endgame\\nNeed I say more? Everything ... | \n", "P | \n", "[ Joker > Endgame\\nNeed I say more?, Everythin... | \n", "5 | \n", "[joker, endgame, need, i, say, more, everythin... | \n", "83 | \n", "[joker, endgame, need, say, everything, movie,... | \n", "53 | \n", "[(joker, 3), (movie, 3), (in, 3), (it, 3), (th... | \n", "[(joker, 3), (movie, 3), (masterful, 2), (awes... | \n", "{'joker': 3, 'endgame': 1, 'need': 1, 'say': 1... | \n", "{'joker': 3, 'endgame': 1, 'need': 1, 'i': 1, ... | \n", "
121 | \n", "Absolutely not a 10\\nStrong fanboy and hype r... | \n", "P | \n", "[ Absolutely not a 10\\nStrong fanboy and hype ... | \n", "5 | \n", "[absolutely, not, a, strong, fanboy, and, hype... | \n", "81 | \n", "[absolutely, strong, fanboy, hype, rush, going... | \n", "36 | \n", "[(the, 7), (is, 6), (a, 4), (fanboy, 2), (and,... | \n", "[(fanboy, 2), (movie, 2), (absolutely, 1), (st... | \n", "{'absolutely': 1, 'strong': 1, 'fanboy': 2, 'h... | \n", "{'absolutely': 1, 'not': 1, 'a': 4, 'strong': ... | \n", "
122 | \n", "Overhyped, but it's alright\\nIt's a good film... | \n", "P | \n", "[ Overhyped, but it's alright\\nIt's a good fil... | \n", "3 | \n", "[overhyped, but, it, alright, it, a, good, fil... | \n", "60 | \n", "[overhyped, alright, good, film, see, like, ma... | \n", "31 | \n", "[(it, 4), (but, 3), (a, 3), (good, 2), (do, 2)... | \n", "[(good, 2), (overhyped, 1), (alright, 1), (fil... | \n", "{'overhyped': 1, 'alright': 1, 'good': 2, 'fil... | \n", "{'overhyped': 1, 'but': 3, 'it': 4, 'alright':... | \n", "
246 rows × 12 columns
\n", "\n", " | 0 | \n", "PoN | \n", "sentences | \n", "num_sentences | \n", "tokens | \n", "num_tokens | \n", "no_sw | \n", "num_no_sw | \n", "topwords_unfil | \n", "topwords_fil | \n", "freq_dist | \n", "freq_dist_unfil | \n", "vader_all | \n", "v_compound | \n", "v_neg | \n", "v_neu | \n", "v_pos | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "Missed Opportunity\\nI had been very excited t... | \n", "N | \n", "[ Missed Opportunity\\nI had been very excited ... | \n", "1 | \n", "[missed, opportunity, i, had, been, very, exci... | \n", "306 | \n", "[missed, opportunity, excited, see, movie, eve... | \n", "140 | \n", "[(of, 13), (i, 12), (the, 12), (that, 10), (it... | \n", "[(movie, 3), (said, 3), (many, 3), (times, 3),... | \n", "{'missed': 2, 'opportunity': 2, 'excited': 1, ... | \n", "{'missed': 2, 'opportunity': 2, 'i': 12, 'had'... | \n", "{'neg': 0.068, 'neu': 0.836, 'pos': 0.096, 'co... | \n", "0.7501 | \n", "0.068 | \n", "0.836 | \n", "0.096 | \n", "
1 | \n", "5/5 for Phoenix's acting..\\nI don't think the... | \n", "N | \n", "[ 5/5 for Phoenix's acting.., I don't think th... | \n", "5 | \n", "[for, phoenix, i, do, think, there, was, a, ne... | \n", "59 | \n", "[phoenix, think, need, super, dark, film, tbh,... | \n", "25 | \n", "[(was, 4), (a, 3), (that, 3), (for, 2), (there... | \n", "[(dark, 2), (phoenix, 1), (think, 1), (need, 1... | \n", "{'phoenix': 1, 'think': 1, 'need': 1, 'super':... | \n", "{'for': 2, 'phoenix': 1, 'i': 1, 'do': 1, 'thi... | \n", "{'neg': 0.069, 'neu': 0.77, 'pos': 0.16, 'comp... | \n", "0.7184 | \n", "0.069 | \n", "0.770 | \n", "0.160 | \n", "
2 | \n", "Everyone praised an overrated movie.\\nOverrat... | \n", "N | \n", "[ Everyone praised an overrated movie., Overra... | \n", "2 | \n", "[everyone, praised, an, overrated, movie, over... | \n", "26 | \n", "[everyone, praised, overrated, movie, overrate... | \n", "13 | \n", "[(overrated, 2), (movie, 2), (everyone, 1), (p... | \n", "[(overrated, 2), (movie, 2), (everyone, 1), (p... | \n", "{'everyone': 1, 'praised': 1, 'overrated': 2, ... | \n", "{'everyone': 1, 'praised': 1, 'an': 1, 'overra... | \n", "{'neg': 0.0, 'neu': 0.79, 'pos': 0.21, 'compou... | \n", "0.7269 | \n", "0.000 | \n", "0.790 | \n", "0.210 | \n", "
3 | \n", "What idiotic FIlm\\nI can say that Phoenix is ... | \n", "N | \n", "[ What idiotic FIlm\\nI can say that Phoenix is... | \n", "4 | \n", "[what, idiotic, film, i, can, say, that, phoen... | \n", "66 | \n", "[idiotic, film, say, phoenix, master, actor, b... | \n", "36 | \n", "[(and, 4), (is, 2), (make, 2), (movie, 2), (to... | \n", "[(make, 2), (movie, 2), (idiotic, 1), (film, 1... | \n", "{'idiotic': 1, 'film': 1, 'say': 1, 'phoenix':... | \n", "{'what': 1, 'idiotic': 1, 'film': 1, 'i': 1, '... | \n", "{'neg': 0.138, 'neu': 0.77, 'pos': 0.092, 'com... | \n", "-0.6698 | \n", "0.138 | \n", "0.770 | \n", "0.092 | \n", "
4 | \n", "Terrible\\nThe only thing good about this movi... | \n", "N | \n", "[ Terrible\\nThe only thing good about this mov... | \n", "9 | \n", "[terrible, the, only, thing, good, about, this... | \n", "124 | \n", "[terrible, thing, good, movie, phoenixs, actin... | \n", "65 | \n", "[(the, 5), (this, 4), (movie, 4), (it, 4), (to... | \n", "[(movie, 4), (terrible, 3), (acting, 3), (good... | \n", "{'terrible': 3, 'thing': 1, 'good': 2, 'movie'... | \n", "{'terrible': 3, 'the': 5, 'only': 1, 'thing': ... | \n", "{'neg': 0.086, 'neu': 0.778, 'pos': 0.136, 'co... | \n", "0.7184 | \n", "0.086 | \n", "0.778 | \n", "0.136 | \n", "
\n", " | 0 | \n", "PoN | \n", "sentences | \n", "num_sentences | \n", "tokens | \n", "num_tokens | \n", "no_sw | \n", "num_no_sw | \n", "topwords_unfil | \n", "topwords_fil | \n", "... | \n", "v_compound_sum | \n", "v_neg_sum | \n", "v_neu_sum | \n", "v_pos_sum | \n", "v_freq_words | \n", "vader_fq_all | \n", "v_compound_fd | \n", "v_neg_fd | \n", "v_neu_fd | \n", "v_pos_fd | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "Missed Opportunity\\nI had been very excited t... | \n", "N | \n", "[ Missed Opportunity\\nI had been very excited ... | \n", "1 | \n", "[missed, opportunity, i, had, been, very, exci... | \n", "306 | \n", "[missed, opportunity, excited, see, movie, eve... | \n", "140 | \n", "[(of, 13), (i, 12), (the, 12), (that, 10), (it... | \n", "[(movie, 3), (said, 3), (many, 3), (times, 3),... | \n", "... | \n", "0.0000 | \n", "0.000 | \n", "0.000 | \n", "0.000 | \n", "movie said many times missed opportunity see t... | \n", "{'neg': 0.181, 'neu': 0.633, 'pos': 0.187, 'co... | \n", "-0.1531 | \n", "0.181 | \n", "0.633 | \n", "0.187 | \n", "
1 | \n", "5/5 for Phoenix's acting..\\nI don't think the... | \n", "N | \n", "[ 5/5 for Phoenix's acting.., I don't think th... | \n", "5 | \n", "[for, phoenix, i, do, think, there, was, a, ne... | \n", "59 | \n", "[phoenix, think, need, super, dark, film, tbh,... | \n", "25 | \n", "[(was, 4), (a, 3), (that, 3), (for, 2), (there... | \n", "[(dark, 2), (phoenix, 1), (think, 1), (need, 1... | \n", "... | \n", "0.4019 | \n", "0.066 | \n", "0.785 | \n", "0.148 | \n", "dark phoenix think need super film tbh dc comi... | \n", "{'neg': 0.075, 'neu': 0.594, 'pos': 0.331, 'co... | \n", "0.8020 | \n", "0.075 | \n", "0.594 | \n", "0.331 | \n", "
2 | \n", "Everyone praised an overrated movie.\\nOverrat... | \n", "N | \n", "[ Everyone praised an overrated movie., Overra... | \n", "2 | \n", "[everyone, praised, an, overrated, movie, over... | \n", "26 | \n", "[everyone, praised, overrated, movie, overrate... | \n", "13 | \n", "[(overrated, 2), (movie, 2), (everyone, 1), (p... | \n", "[(overrated, 2), (movie, 2), (everyone, 1), (p... | \n", "... | \n", "0.7269 | \n", "0.000 | \n", "0.790 | \n", "0.210 | \n", "overrated movie everyone praised time reviews ... | \n", "{'neg': 0.0, 'neu': 0.596, 'pos': 0.404, 'comp... | \n", "0.7269 | \n", "0.000 | \n", "0.596 | \n", "0.404 | \n", "
3 | \n", "What idiotic FIlm\\nI can say that Phoenix is ... | \n", "N | \n", "[ What idiotic FIlm\\nI can say that Phoenix is... | \n", "4 | \n", "[what, idiotic, film, i, can, say, that, phoen... | \n", "66 | \n", "[idiotic, film, say, phoenix, master, actor, b... | \n", "36 | \n", "[(and, 4), (is, 2), (make, 2), (movie, 2), (to... | \n", "[(make, 2), (movie, 2), (idiotic, 1), (film, 1... | \n", "... | \n", "-0.6591 | \n", "0.175 | \n", "0.762 | \n", "0.063 | \n", "make movie idiotic film say phoenix master act... | \n", "{'neg': 0.184, 'neu': 0.609, 'pos': 0.207, 'co... | \n", "0.2570 | \n", "0.184 | \n", "0.609 | \n", "0.207 | \n", "
4 | \n", "Terrible\\nThe only thing good about this movi... | \n", "N | \n", "[ Terrible\\nThe only thing good about this mov... | \n", "9 | \n", "[terrible, the, only, thing, good, about, this... | \n", "124 | \n", "[terrible, thing, good, movie, phoenixs, actin... | \n", "65 | \n", "[(the, 5), (this, 4), (movie, 4), (it, 4), (to... | \n", "[(movie, 4), (terrible, 3), (acting, 3), (good... | \n", "... | \n", "0.7311 | \n", "0.071 | \n", "0.779 | \n", "0.150 | \n", "movie terrible acting good dont movies plot ma... | \n", "{'neg': 0.193, 'neu': 0.58, 'pos': 0.227, 'com... | \n", "0.3261 | \n", "0.193 | \n", "0.580 | \n", "0.227 | \n", "
5 rows × 31 columns
\n", "\n", " | 0 | \n", "PoN | \n", "sentences | \n", "num_sentences | \n", "tokens | \n", "num_tokens | \n", "no_sw | \n", "num_no_sw | \n", "topwords_unfil | \n", "topwords_fil | \n", "... | \n", "v_freq_words_unfil | \n", "vader_fd_all_unfil | \n", "v_compound_fd_uf | \n", "v_neg_fd_uf | \n", "v_neu_fd_uf | \n", "v_pos_fd_uf | \n", "nltk_negs | \n", "unigram_feats | \n", "bigram_feats | \n", "bigram_feats_neg | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "Missed Opportunity\\nI had been very excited t... | \n", "N | \n", "[ Missed Opportunity\\nI had been very excited ... | \n", "1 | \n", "[missed, opportunity, i, had, been, very, exci... | \n", "306 | \n", "[missed, opportunity, excited, see, movie, eve... | \n", "140 | \n", "[(of, 13), (i, 12), (the, 12), (that, 10), (it... | \n", "[(movie, 3), (said, 3), (many, 3), (times, 3),... | \n", "... | \n", "of i the that it a and to was had been very mo... | \n", "{'neg': 0.041, 'neu': 0.816, 'pos': 0.143, 'co... | \n", "0.6705 | \n", "0.041 | \n", "0.816 | \n", "0.143 | \n", "[missed, opportunity, i, had, been, very, exci... | \n", "[of, i, the, that, it, a, and, to, was, had, b... | \n", "[missed_opportunity, opportunity_i, i_had, had... | \n", "[missed_opportunity, opportunity_i, i_had, had... | \n", "
1 | \n", "5/5 for Phoenix's acting..\\nI don't think the... | \n", "N | \n", "[ 5/5 for Phoenix's acting.., I don't think th... | \n", "5 | \n", "[for, phoenix, i, do, think, there, was, a, ne... | \n", "59 | \n", "[phoenix, think, need, super, dark, film, tbh,... | \n", "25 | \n", "[(was, 4), (a, 3), (that, 3), (for, 2), (there... | \n", "[(dark, 2), (phoenix, 1), (think, 1), (need, 1... | \n", "... | \n", "was a that for there dark in it phoenix i do t... | \n", "{'neg': 0.097, 'neu': 0.732, 'pos': 0.171, 'co... | \n", "0.4427 | \n", "0.097 | \n", "0.732 | \n", "0.171 | \n", "[for, phoenix, i, do, think, there, was, a, ne... | \n", "[a, for, was, dark, was_NEG, that_NEG, phoenix... | \n", "[for_phoenix, phoenix_i, i_do, do_think, think... | \n", "[for_phoenix, phoenix_i, i_do, do_think, think... | \n", "
2 | \n", "Everyone praised an overrated movie.\\nOverrat... | \n", "N | \n", "[ Everyone praised an overrated movie., Overra... | \n", "2 | \n", "[everyone, praised, an, overrated, movie, over... | \n", "26 | \n", "[everyone, praised, overrated, movie, overrate... | \n", "13 | \n", "[(overrated, 2), (movie, 2), (everyone, 1), (p... | \n", "[(overrated, 2), (movie, 2), (everyone, 1), (p... | \n", "... | \n", "overrated movie everyone praised an of all tim... | \n", "{'neg': 0.0, 'neu': 0.783, 'pos': 0.217, 'comp... | \n", "0.7269 | \n", "0.000 | \n", "0.783 | \n", "0.217 | \n", "[everyone, praised, an, overrated, movie, over... | \n", "[overrated, movie, everyone, praised, an, of, ... | \n", "[everyone_praised, praised_an, an_overrated, o... | \n", "[everyone_praised, praised_an, an_overrated, o... | \n", "
3 | \n", "What idiotic FIlm\\nI can say that Phoenix is ... | \n", "N | \n", "[ What idiotic FIlm\\nI can say that Phoenix is... | \n", "4 | \n", "[what, idiotic, film, i, can, say, that, phoen... | \n", "66 | \n", "[idiotic, film, say, phoenix, master, actor, b... | \n", "36 | \n", "[(and, 4), (is, 2), (make, 2), (movie, 2), (to... | \n", "[(make, 2), (movie, 2), (idiotic, 1), (film, 1... | \n", "... | \n", "and is make movie to with the what idiotic fil... | \n", "{'neg': 0.208, 'neu': 0.741, 'pos': 0.051, 'co... | \n", "-0.8344 | \n", "0.208 | \n", "0.741 | \n", "0.051 | \n", "[what, idiotic, film, i, can, say, that, phoen... | \n", "[and_NEG, make_NEG, movie_NEG, to_NEG, with_NE... | \n", "[what_idiotic, idiotic_film, film_i, i_can, ca... | \n", "[what_idiotic, idiotic_film, film_i, i_can, ca... | \n", "
4 | \n", "Terrible\\nThe only thing good about this movi... | \n", "N | \n", "[ Terrible\\nThe only thing good about this mov... | \n", "9 | \n", "[terrible, the, only, thing, good, about, this... | \n", "124 | \n", "[terrible, thing, good, movie, phoenixs, actin... | \n", "65 | \n", "[(the, 5), (this, 4), (movie, 4), (it, 4), (to... | \n", "[(movie, 4), (terrible, 3), (acting, 3), (good... | \n", "... | \n", "the this movie it to terrible acting but and f... | \n", "{'neg': 0.181, 'neu': 0.703, 'pos': 0.116, 'co... | \n", "-0.5853 | \n", "0.181 | \n", "0.703 | \n", "0.116 | \n", "[terrible, the, only, thing, good, about, this... | \n", "[it_NEG, the_NEG, to_NEG, and_NEG, for_NEG, th... | \n", "[terrible_the, the_only, only_thing, thing_goo... | \n", "[terrible_the, the_only, only_thing, thing_goo... | \n", "
5 rows × 41 columns
\n", "\n", " | 0 | \n", "PoN | \n", "sentences | \n", "num_sentences | \n", "tokens | \n", "num_tokens | \n", "no_sw | \n", "num_no_sw | \n", "topwords_unfil | \n", "topwords_fil | \n", "... | \n", "vader_fd_all_unfil | \n", "v_compound_fd_uf | \n", "v_neg_fd_uf | \n", "v_neu_fd_uf | \n", "v_pos_fd_uf | \n", "nltk_negs | \n", "unigram_feats | \n", "bigram_feats | \n", "bigram_feats_neg | \n", "nltk_all | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "Missed Opportunity\\nI had been very excited t... | \n", "N | \n", "[ Missed Opportunity\\nI had been very excited ... | \n", "1 | \n", "[missed, opportunity, i, had, been, very, exci... | \n", "306 | \n", "[missed, opportunity, excited, see, movie, eve... | \n", "140 | \n", "[(of, 13), (i, 12), (the, 12), (that, 10), (it... | \n", "[(movie, 3), (said, 3), (many, 3), (times, 3),... | \n", "... | \n", "{'neg': 0.041, 'neu': 0.816, 'pos': 0.143, 'co... | \n", "0.6705 | \n", "0.041 | \n", "0.816 | \n", "0.143 | \n", "[missed, opportunity, i, had, been, very, exci... | \n", "[of, i, the, that, it, a, and, to, was, had, b... | \n", "[missed_opportunity, opportunity_i, i_had, had... | \n", "[missed_opportunity, opportunity_i, i_had, had... | \n", "0 | \n", "
1 | \n", "5/5 for Phoenix's acting..\\nI don't think the... | \n", "N | \n", "[ 5/5 for Phoenix's acting.., I don't think th... | \n", "5 | \n", "[for, phoenix, i, do, think, there, was, a, ne... | \n", "59 | \n", "[phoenix, think, need, super, dark, film, tbh,... | \n", "25 | \n", "[(was, 4), (a, 3), (that, 3), (for, 2), (there... | \n", "[(dark, 2), (phoenix, 1), (think, 1), (need, 1... | \n", "... | \n", "{'neg': 0.097, 'neu': 0.732, 'pos': 0.171, 'co... | \n", "0.4427 | \n", "0.097 | \n", "0.732 | \n", "0.171 | \n", "[for, phoenix, i, do, think, there, was, a, ne... | \n", "[a, for, was, dark, was_NEG, that_NEG, phoenix... | \n", "[for_phoenix, phoenix_i, i_do, do_think, think... | \n", "[for_phoenix, phoenix_i, i_do, do_think, think... | \n", "0 | \n", "
2 | \n", "Everyone praised an overrated movie.\\nOverrat... | \n", "N | \n", "[ Everyone praised an overrated movie., Overra... | \n", "2 | \n", "[everyone, praised, an, overrated, movie, over... | \n", "26 | \n", "[everyone, praised, overrated, movie, overrate... | \n", "13 | \n", "[(overrated, 2), (movie, 2), (everyone, 1), (p... | \n", "[(overrated, 2), (movie, 2), (everyone, 1), (p... | \n", "... | \n", "{'neg': 0.0, 'neu': 0.783, 'pos': 0.217, 'comp... | \n", "0.7269 | \n", "0.000 | \n", "0.783 | \n", "0.217 | \n", "[everyone, praised, an, overrated, movie, over... | \n", "[overrated, movie, everyone, praised, an, of, ... | \n", "[everyone_praised, praised_an, an_overrated, o... | \n", "[everyone_praised, praised_an, an_overrated, o... | \n", "0 | \n", "
3 rows × 42 columns
\n", "\n", " | 0 | \n", "PoN | \n", "sentences | \n", "num_sentences | \n", "tokens | \n", "num_tokens | \n", "no_sw | \n", "num_no_sw | \n", "topwords_unfil | \n", "topwords_fil | \n", "... | \n", "v_compound_fd_uf | \n", "v_neg_fd_uf | \n", "v_neu_fd_uf | \n", "v_pos_fd_uf | \n", "nltk_negs | \n", "unigram_feats | \n", "bigram_feats | \n", "bigram_feats_neg | \n", "nltk_all | \n", "bow_nosw | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "Missed Opportunity\\nI had been very excited t... | \n", "N | \n", "[ Missed Opportunity\\nI had been very excited ... | \n", "1 | \n", "[missed, opportunity, i, had, been, very, exci... | \n", "306 | \n", "[missed, opportunity, excited, see, movie, eve... | \n", "140 | \n", "[(of, 13), (i, 12), (the, 12), (that, 10), (it... | \n", "[(movie, 3), (said, 3), (many, 3), (times, 3),... | \n", "... | \n", "0.6705 | \n", "0.041 | \n", "0.816 | \n", "0.143 | \n", "[missed, opportunity, i, had, been, very, exci... | \n", "[of, i, the, that, it, a, and, to, was, had, b... | \n", "[missed_opportunity, opportunity_i, i_had, had... | \n", "[missed_opportunity, opportunity_i, i_had, had... | \n", "0 | \n", "{'Missed': 1, 'Opportunity': 1, 'I': 14, 'had'... | \n", "
1 | \n", "5/5 for Phoenix's acting..\\nI don't think the... | \n", "N | \n", "[ 5/5 for Phoenix's acting.., I don't think th... | \n", "5 | \n", "[for, phoenix, i, do, think, there, was, a, ne... | \n", "59 | \n", "[phoenix, think, need, super, dark, film, tbh,... | \n", "25 | \n", "[(was, 4), (a, 3), (that, 3), (for, 2), (there... | \n", "[(dark, 2), (phoenix, 1), (think, 1), (need, 1... | \n", "... | \n", "0.4427 | \n", "0.097 | \n", "0.732 | \n", "0.171 | \n", "[for, phoenix, i, do, think, there, was, a, ne... | \n", "[a, for, was, dark, was_NEG, that_NEG, phoenix... | \n", "[for_phoenix, phoenix_i, i_do, do_think, think... | \n", "[for_phoenix, phoenix_i, i_do, do_think, think... | \n", "0 | \n", "{'5/5': 1, 'for': 2, 'Phoenix's': 1, 'acting':... | \n", "
2 | \n", "Everyone praised an overrated movie.\\nOverrat... | \n", "N | \n", "[ Everyone praised an overrated movie., Overra... | \n", "2 | \n", "[everyone, praised, an, overrated, movie, over... | \n", "26 | \n", "[everyone, praised, overrated, movie, overrate... | \n", "13 | \n", "[(overrated, 2), (movie, 2), (everyone, 1), (p... | \n", "[(overrated, 2), (movie, 2), (everyone, 1), (p... | \n", "... | \n", "0.7269 | \n", "0.000 | \n", "0.783 | \n", "0.217 | \n", "[everyone, praised, an, overrated, movie, over... | \n", "[overrated, movie, everyone, praised, an, of, ... | \n", "[everyone_praised, praised_an, an_overrated, o... | \n", "[everyone_praised, praised_an, an_overrated, o... | \n", "0 | \n", "{'Everyone': 1, 'praised': 1, 'an': 1, 'overra... | \n", "
3 rows × 43 columns
\n", "\n", " | 0 | \n", "PoN | \n", "sentences | \n", "num_sentences | \n", "tokens | \n", "num_tokens | \n", "no_sw | \n", "num_no_sw | \n", "topwords_unfil | \n", "topwords_fil | \n", "... | \n", "v_neu_fd_uf | \n", "v_pos_fd_uf | \n", "nltk_negs | \n", "unigram_feats | \n", "bigram_feats | \n", "bigram_feats_neg | \n", "nltk_all | \n", "bow_nosw | \n", "diy_cleaner | \n", "bow_v2 | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "Missed Opportunity\\nI had been very excited t... | \n", "N | \n", "[ Missed Opportunity\\nI had been very excited ... | \n", "1 | \n", "[missed, opportunity, i, had, been, very, exci... | \n", "306 | \n", "[missed, opportunity, excited, see, movie, eve... | \n", "140 | \n", "[(of, 13), (i, 12), (the, 12), (that, 10), (it... | \n", "[(movie, 3), (said, 3), (many, 3), (times, 3),... | \n", "... | \n", "0.816 | \n", "0.143 | \n", "[missed, opportunity, i, had, been, very, exci... | \n", "[of, i, the, that, it, a, and, to, was, had, b... | \n", "[missed_opportunity, opportunity_i, i_had, had... | \n", "[missed_opportunity, opportunity_i, i_had, had... | \n", "0 | \n", "{'Missed': 1, 'Opportunity': 1, 'I': 14, 'had'... | \n", "missed opportunity missed opportunity i had ... | \n", "{'missed': 3, 'opportunity': 3, 'i': 14, 'had'... | \n", "
1 | \n", "5/5 for Phoenix's acting..\\nI don't think the... | \n", "N | \n", "[ 5/5 for Phoenix's acting.., I don't think th... | \n", "5 | \n", "[for, phoenix, i, do, think, there, was, a, ne... | \n", "59 | \n", "[phoenix, think, need, super, dark, film, tbh,... | \n", "25 | \n", "[(was, 4), (a, 3), (that, 3), (for, 2), (there... | \n", "[(dark, 2), (phoenix, 1), (think, 1), (need, 1... | \n", "... | \n", "0.732 | \n", "0.171 | \n", "[for, phoenix, i, do, think, there, was, a, ne... | \n", "[a, for, was, dark, was_NEG, that_NEG, phoenix... | \n", "[for_phoenix, phoenix_i, i_do, do_think, think... | \n", "[for_phoenix, phoenix_i, i_do, do_think, think... | \n", "0 | \n", "{'5/5': 1, 'for': 2, 'Phoenix's': 1, 'acting':... | \n", "5/5 for phoenix's acting.. 5/5 for phoenix's... | \n", "{'5/5': 2, 'for': 3, 'phoenix's': 2, 'acting':... | \n", "
2 | \n", "Everyone praised an overrated movie.\\nOverrat... | \n", "N | \n", "[ Everyone praised an overrated movie., Overra... | \n", "2 | \n", "[everyone, praised, an, overrated, movie, over... | \n", "26 | \n", "[everyone, praised, overrated, movie, overrate... | \n", "13 | \n", "[(overrated, 2), (movie, 2), (everyone, 1), (p... | \n", "[(overrated, 2), (movie, 2), (everyone, 1), (p... | \n", "... | \n", "0.783 | \n", "0.217 | \n", "[everyone, praised, an, overrated, movie, over... | \n", "[overrated, movie, everyone, praised, an, of, ... | \n", "[everyone_praised, praised_an, an_overrated, o... | \n", "[everyone_praised, praised_an, an_overrated, o... | \n", "0 | \n", "{'Everyone': 1, 'praised': 1, 'an': 1, 'overra... | \n", "everyone praised an overrated movie. everyon... | \n", "{'everyone': 2, 'praised': 2, 'an': 2, 'overra... | \n", "
3 rows × 45 columns
\n", "\n", " | missed | \n", "opportunity | \n", "been | \n", "very | \n", "excited | \n", "this | \n", "movie | \n", "ever | \n", "since | \n", "heard | \n", "... | \n", "information | \n", "follows | \n", "stress | \n", "overlook | \n", "offers | \n", "easy | \n", "answers | \n", "alike | \n", "company | \n", "acceptable | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
PoN | \n", "\n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " |
N | \n", "3 | \n", "3 | \n", "4 | \n", "3 | \n", "1 | \n", "2 | \n", "3 | \n", "1 | \n", "1 | \n", "1 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
N | \n", "0 | \n", "0 | \n", "1 | \n", "0 | \n", "0 | \n", "1 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
N | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "1 | \n", "3 | \n", "0 | \n", "0 | \n", "0 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
N | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "2 | \n", "2 | \n", "0 | \n", "0 | \n", "0 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
N | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "4 | \n", "4 | \n", "0 | \n", "0 | \n", "0 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
5 rows × 4249 columns
\n", "\n", " | missed | \n", "opportunity | \n", "i | \n", "had | \n", "been | \n", "very | \n", "excited | \n", "to | \n", "see | \n", "this | \n", "... | \n", "stress | \n", "overlook | \n", "> | \n", "offers | \n", "easy | \n", "answers | \n", "alike | \n", "company | \n", "til | \n", "acceptable | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
PoN | \n", "\n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " |
N | \n", "3 | \n", "3 | \n", "14 | \n", "4 | \n", "4 | \n", "3 | \n", "1 | \n", "6 | \n", "2 | \n", "2 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
N | \n", "0 | \n", "0 | \n", "1 | \n", "0 | \n", "1 | \n", "0 | \n", "0 | \n", "1 | \n", "1 | \n", "1 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
N | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "1 | \n", "0 | \n", "1 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
N | \n", "0 | \n", "0 | \n", "1 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "2 | \n", "0 | \n", "2 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
N | \n", "0 | \n", "0 | \n", "2 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "4 | \n", "0 | \n", "4 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
5 rows × 4530 columns
\n", "\n", " | Missed | \n", "Opportunity | \n", "I | \n", "had | \n", "been | \n", "very | \n", "excited | \n", "to | \n", "see | \n", "this | \n", "... | \n", "Strong | \n", "iq | \n", "Deff | \n", "comparable | \n", "knight | \n", "compensate | \n", "company | \n", "up.Movie | \n", "til | \n", "acceptable | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
PoN | \n", "\n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " |
N | \n", "1 | \n", "1 | \n", "14 | \n", "4 | \n", "4 | \n", "3 | \n", "1 | \n", "6 | \n", "2 | \n", "2 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
N | \n", "0 | \n", "0 | \n", "1 | \n", "0 | \n", "1 | \n", "0 | \n", "0 | \n", "1 | \n", "1 | \n", "1 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
N | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "1 | \n", "0 | \n", "1 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
N | \n", "0 | \n", "0 | \n", "1 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "2 | \n", "0 | \n", "1 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
N | \n", "0 | \n", "0 | \n", "1 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "4 | \n", "0 | \n", "4 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
5 rows × 6168 columns
\n", "\n", " | missed | \n", "opportunity | \n", "i | \n", "had | \n", "been | \n", "very | \n", "excited | \n", "to | \n", "see | \n", "this | \n", "... | \n", "paced_NEG | \n", "annoying_NEG | \n", "deff_NEG | \n", "comparable_NEG | \n", "compensate_NEG | \n", "marketing | \n", "company | \n", "climax | \n", "strange | \n", "acceptable | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
PoN | \n", "\n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " |
N | \n", "2 | \n", "2 | \n", "12 | \n", "4 | \n", "4 | \n", "3 | \n", "1 | \n", "6 | \n", "2 | \n", "2 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
N | \n", "0 | \n", "0 | \n", "1 | \n", "0 | \n", "1 | \n", "0 | \n", "0 | \n", "1 | \n", "0 | \n", "1 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
N | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "1 | \n", "0 | \n", "1 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
N | \n", "0 | \n", "0 | \n", "1 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "1 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
N | \n", "0 | \n", "0 | \n", "1 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "1 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
5 rows × 6406 columns
\n", "\n", " | missed_opportunity | \n", "opportunity_i | \n", "i_had | \n", "had_been | \n", "been_very | \n", "very_excited | \n", "excited_to | \n", "to_see | \n", "see_this | \n", "this_movie | \n", "... | \n", "slow_do | \n", "happen_the | \n", "middle_and | \n", "climax_cinematography | \n", "joker_acting | \n", "fine_if | \n", "forced_at | \n", "just_strange | \n", "strange_but | \n", "but_acceptable | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
PoN | \n", "\n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " |
N | \n", "2 | \n", "1 | \n", "3 | \n", "2 | \n", "1 | \n", "1 | \n", "1 | \n", "1 | \n", "1 | \n", "2 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
N | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
N | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
N | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
N | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "4 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
5 rows × 24658 columns
\n", "\n", " | 0 | \n", "PoN | \n", "sentences | \n", "num_sentences | \n", "tokens | \n", "num_tokens | \n", "no_sw | \n", "num_no_sw | \n", "topwords_unfil | \n", "topwords_fil | \n", "... | \n", "bigram_feats_neg | \n", "nltk_all | \n", "bow_nosw | \n", "diy_cleaner | \n", "bow_v2 | \n", "pruned | \n", "bow_v3 | \n", "bow_v4 | \n", "bow_v5 | \n", "no_shared_words | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "Missed Opportunity\\nI had been very excited t... | \n", "N | \n", "[ Missed Opportunity\\nI had been very excited ... | \n", "1 | \n", "[missed, opportunity, i, had, been, very, exci... | \n", "306 | \n", "[missed, opportunity, excited, see, movie, eve... | \n", "140 | \n", "[(of, 13), (i, 12), (the, 12), (that, 10), (it... | \n", "[(movie, 3), (said, 3), (many, 3), (times, 3),... | \n", "... | \n", "[missed_opportunity, opportunity_i, i_had, had... | \n", "0 | \n", "{'Missed': 1, 'Opportunity': 1, 'I': 14, 'had'... | \n", "missed opportunity missed opportunity i had ... | \n", "{'missed': 3, 'opportunity': 3, 'i': 14, 'had'... | \n", "missed opportunity missed opportunity been ver... | \n", "{'missed': 3, 'opportunity': 3, 'been': 4, 've... | \n", "{'missed': 2, 'opportunity': 2, 'i': 12, 'had'... | \n", "{'missed_opportunity': 2, 'opportunity_i': 1, ... | \n", "[missed, opportunity, i, had, excited, to, see... | \n", "
1 | \n", "5/5 for Phoenix's acting..\\nI don't think the... | \n", "N | \n", "[ 5/5 for Phoenix's acting.., I don't think th... | \n", "5 | \n", "[for, phoenix, i, do, think, there, was, a, ne... | \n", "59 | \n", "[phoenix, think, need, super, dark, film, tbh,... | \n", "25 | \n", "[(was, 4), (a, 3), (that, 3), (for, 2), (there... | \n", "[(dark, 2), (phoenix, 1), (think, 1), (need, 1... | \n", "... | \n", "[for_phoenix, phoenix_i, i_do, do_think, think... | \n", "0 | \n", "{'5/5': 1, 'for': 2, 'Phoenix's': 1, 'acting':... | \n", "5/5 for phoenix's acting.. 5/5 for phoenix's... | \n", "{'5/5': 2, 'for': 3, 'phoenix's': 2, 'acting':... | \n", "phoenix's acting.. phoenix's acting.. dont thi... | \n", "{'phoenix's': 2, 'acting': 2, '..': 2, 'dont':... | \n", "{'for': 2, 'phoenix': 1, 'i': 1, 'do': 1, 'thi... | \n", "{'for_phoenix': 1, 'phoenix_i': 1, 'i_do': 1, ... | \n", "[for, i, do, was, a, for, a, tbh, is, a, dc, h... | \n", "
2 | \n", "Everyone praised an overrated movie.\\nOverrat... | \n", "N | \n", "[ Everyone praised an overrated movie., Overra... | \n", "2 | \n", "[everyone, praised, an, overrated, movie, over... | \n", "26 | \n", "[everyone, praised, overrated, movie, overrate... | \n", "13 | \n", "[(overrated, 2), (movie, 2), (everyone, 1), (p... | \n", "[(overrated, 2), (movie, 2), (everyone, 1), (p... | \n", "... | \n", "[everyone_praised, praised_an, an_overrated, o... | \n", "0 | \n", "{'Everyone': 1, 'praised': 1, 'an': 1, 'overra... | \n", "everyone praised an overrated movie. everyon... | \n", "{'everyone': 2, 'praised': 2, 'an': 2, 'overra... | \n", "everyone praised overrated movie. everyone pra... | \n", "{'everyone': 2, 'praised': 2, 'overrated': 3, ... | \n", "{'everyone': 1, 'praised': 1, 'an': 1, 'overra... | \n", "{'everyone_praised': 1, 'praised_an': 1, 'an_o... | \n", "[praised, an, of, all, the, are, out, to, be, ... | \n", "
3 | \n", "What idiotic FIlm\\nI can say that Phoenix is ... | \n", "N | \n", "[ What idiotic FIlm\\nI can say that Phoenix is... | \n", "4 | \n", "[what, idiotic, film, i, can, say, that, phoen... | \n", "66 | \n", "[idiotic, film, say, phoenix, master, actor, b... | \n", "36 | \n", "[(and, 4), (is, 2), (make, 2), (movie, 2), (to... | \n", "[(make, 2), (movie, 2), (idiotic, 1), (film, 1... | \n", "... | \n", "[what_idiotic, idiotic_film, film_i, i_can, ca... | \n", "0 | \n", "{'What': 1, 'idiotic': 1, 'FIlm': 1, 'I': 1, '... | \n", "what idiotic film what idiotic film i can sa... | \n", "{'what': 2, 'idiotic': 2, 'film': 2, 'i': 1, '... | \n", "what idiotic film what idiotic film that phoen... | \n", "{'what': 2, 'idiotic': 2, 'film': 2, 'that': 1... | \n", "{'what': 1, 'idiotic': 1, 'film': 1, 'i': 1, '... | \n", "{'what_idiotic': 1, 'idiotic_film': 1, 'film_i... | \n", "[idiotic, i, can, say, is, master, bt, not, a,... | \n", "
4 | \n", "Terrible\\nThe only thing good about this movi... | \n", "N | \n", "[ Terrible\\nThe only thing good about this mov... | \n", "9 | \n", "[terrible, the, only, thing, good, about, this... | \n", "124 | \n", "[terrible, thing, good, movie, phoenixs, actin... | \n", "65 | \n", "[(the, 5), (this, 4), (movie, 4), (it, 4), (to... | \n", "[(movie, 4), (terrible, 3), (acting, 3), (good... | \n", "... | \n", "[terrible_the, the_only, only_thing, thing_goo... | \n", "0 | \n", "{'Terrible': 1, 'The': 2, 'only': 1, 'thing': ... | \n", "terrible terrible the only thing good about ... | \n", "{'terrible': 4, 'the': 5, 'only': 1, 'thing': ... | \n", "terrible terrible only thing good about this m... | \n", "{'terrible': 4, 'only': 1, 'thing': 1, 'good':... | \n", "{'terrible': 1, 'the': 1, 'only': 1, 'thing': ... | \n", "{'terrible_the': 1, 'the_only': 1, 'only_thing... | \n", "[terrible, the, is, but, i, and, for, entertai... | \n", "
5 rows × 50 columns
\n", "" ], "text/plain": [ " 0 PoN \\\n", "0 Missed Opportunity\\nI had been very excited t... N \n", "1 5/5 for Phoenix's acting..\\nI don't think the... N \n", "2 Everyone praised an overrated movie.\\nOverrat... N \n", "3 What idiotic FIlm\\nI can say that Phoenix is ... N \n", "4 Terrible\\nThe only thing good about this movi... N \n", "\n", " sentences num_sentences \\\n", "0 [ Missed Opportunity\\nI had been very excited ... 1 \n", "1 [ 5/5 for Phoenix's acting.., I don't think th... 5 \n", "2 [ Everyone praised an overrated movie., Overra... 2 \n", "3 [ What idiotic FIlm\\nI can say that Phoenix is... 4 \n", "4 [ Terrible\\nThe only thing good about this mov... 9 \n", "\n", " tokens num_tokens \\\n", "0 [missed, opportunity, i, had, been, very, exci... 306 \n", "1 [for, phoenix, i, do, think, there, was, a, ne... 59 \n", "2 [everyone, praised, an, overrated, movie, over... 26 \n", "3 [what, idiotic, film, i, can, say, that, phoen... 66 \n", "4 [terrible, the, only, thing, good, about, this... 124 \n", "\n", " no_sw num_no_sw \\\n", "0 [missed, opportunity, excited, see, movie, eve... 140 \n", "1 [phoenix, think, need, super, dark, film, tbh,... 25 \n", "2 [everyone, praised, overrated, movie, overrate... 13 \n", "3 [idiotic, film, say, phoenix, master, actor, b... 36 \n", "4 [terrible, thing, good, movie, phoenixs, actin... 65 \n", "\n", " topwords_unfil \\\n", "0 [(of, 13), (i, 12), (the, 12), (that, 10), (it... \n", "1 [(was, 4), (a, 3), (that, 3), (for, 2), (there... \n", "2 [(overrated, 2), (movie, 2), (everyone, 1), (p... \n", "3 [(and, 4), (is, 2), (make, 2), (movie, 2), (to... \n", "4 [(the, 5), (this, 4), (movie, 4), (it, 4), (to... \n", "\n", " topwords_fil ... \\\n", "0 [(movie, 3), (said, 3), (many, 3), (times, 3),... ... \n", "1 [(dark, 2), (phoenix, 1), (think, 1), (need, 1... ... \n", "2 [(overrated, 2), (movie, 2), (everyone, 1), (p... ... \n", "3 [(make, 2), (movie, 2), (idiotic, 1), (film, 1... ... \n", "4 [(movie, 4), (terrible, 3), (acting, 3), (good... ... \n", "\n", " bigram_feats_neg nltk_all \\\n", "0 [missed_opportunity, opportunity_i, i_had, had... 0 \n", "1 [for_phoenix, phoenix_i, i_do, do_think, think... 0 \n", "2 [everyone_praised, praised_an, an_overrated, o... 0 \n", "3 [what_idiotic, idiotic_film, film_i, i_can, ca... 0 \n", "4 [terrible_the, the_only, only_thing, thing_goo... 0 \n", "\n", " bow_nosw \\\n", "0 {'Missed': 1, 'Opportunity': 1, 'I': 14, 'had'... \n", "1 {'5/5': 1, 'for': 2, 'Phoenix's': 1, 'acting':... \n", "2 {'Everyone': 1, 'praised': 1, 'an': 1, 'overra... \n", "3 {'What': 1, 'idiotic': 1, 'FIlm': 1, 'I': 1, '... \n", "4 {'Terrible': 1, 'The': 2, 'only': 1, 'thing': ... \n", "\n", " diy_cleaner \\\n", "0 missed opportunity missed opportunity i had ... \n", "1 5/5 for phoenix's acting.. 5/5 for phoenix's... \n", "2 everyone praised an overrated movie. everyon... \n", "3 what idiotic film what idiotic film i can sa... \n", "4 terrible terrible the only thing good about ... \n", "\n", " bow_v2 \\\n", "0 {'missed': 3, 'opportunity': 3, 'i': 14, 'had'... \n", "1 {'5/5': 2, 'for': 3, 'phoenix's': 2, 'acting':... \n", "2 {'everyone': 2, 'praised': 2, 'an': 2, 'overra... \n", "3 {'what': 2, 'idiotic': 2, 'film': 2, 'i': 1, '... \n", "4 {'terrible': 4, 'the': 5, 'only': 1, 'thing': ... \n", "\n", " pruned \\\n", "0 missed opportunity missed opportunity been ver... \n", "1 phoenix's acting.. phoenix's acting.. dont thi... \n", "2 everyone praised overrated movie. everyone pra... \n", "3 what idiotic film what idiotic film that phoen... \n", "4 terrible terrible only thing good about this m... \n", "\n", " bow_v3 \\\n", "0 {'missed': 3, 'opportunity': 3, 'been': 4, 've... \n", "1 {'phoenix's': 2, 'acting': 2, '..': 2, 'dont':... \n", "2 {'everyone': 2, 'praised': 2, 'overrated': 3, ... \n", "3 {'what': 2, 'idiotic': 2, 'film': 2, 'that': 1... \n", "4 {'terrible': 4, 'only': 1, 'thing': 1, 'good':... \n", "\n", " bow_v4 \\\n", "0 {'missed': 2, 'opportunity': 2, 'i': 12, 'had'... \n", "1 {'for': 2, 'phoenix': 1, 'i': 1, 'do': 1, 'thi... \n", "2 {'everyone': 1, 'praised': 1, 'an': 1, 'overra... \n", "3 {'what': 1, 'idiotic': 1, 'film': 1, 'i': 1, '... \n", "4 {'terrible': 1, 'the': 1, 'only': 1, 'thing': ... \n", "\n", " bow_v5 \\\n", "0 {'missed_opportunity': 2, 'opportunity_i': 1, ... \n", "1 {'for_phoenix': 1, 'phoenix_i': 1, 'i_do': 1, ... \n", "2 {'everyone_praised': 1, 'praised_an': 1, 'an_o... \n", "3 {'what_idiotic': 1, 'idiotic_film': 1, 'film_i... \n", "4 {'terrible_the': 1, 'the_only': 1, 'only_thing... \n", "\n", " no_shared_words \n", "0 [missed, opportunity, i, had, excited, to, see... \n", "1 [for, i, do, was, a, for, a, tbh, is, a, dc, h... \n", "2 [praised, an, of, all, the, are, out, to, be, ... \n", "3 [idiotic, i, can, say, is, master, bt, not, a,... \n", "4 [terrible, the, is, but, i, and, for, entertai... \n", "\n", "[5 rows x 50 columns]" ] }, "execution_count": 93, "metadata": {}, "output_type": "execute_result" } ], "source": [ "all_df[:5]" ] }, { "cell_type": "code", "execution_count": 94, "metadata": {}, "outputs": [ { "data": { "text/html": [ "\n", " | missed | \n", "opportunity | \n", "i | \n", "had | \n", "excited | \n", "to | \n", "see | \n", "heard | \n", "it | \n", "and | \n", "... | \n", "overlook | \n", "easy | \n", "answers | \n", "alike | \n", "iq | \n", "deff | \n", "comparable | \n", "compensate | \n", "company | \n", "acceptable | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
PoN | \n", "\n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " |
N | \n", "2 | \n", "2 | \n", "12 | \n", "4 | \n", "1 | \n", "6 | \n", "2 | \n", "1 | \n", "9 | \n", "7 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
N | \n", "0 | \n", "0 | \n", "1 | \n", "0 | \n", "0 | \n", "1 | \n", "1 | \n", "0 | \n", "2 | \n", "1 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
N | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "1 | \n", "0 | \n", "0 | \n", "1 | \n", "0 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
N | \n", "0 | \n", "0 | \n", "1 | \n", "0 | \n", "0 | \n", "2 | \n", "0 | \n", "0 | \n", "0 | \n", "4 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
N | \n", "0 | \n", "0 | \n", "2 | \n", "0 | \n", "0 | \n", "4 | \n", "0 | \n", "0 | \n", "4 | \n", "3 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
5 rows × 4506 columns
\n", "\n", " | missed | \n", "opportunity | \n", "i | \n", "had | \n", "been | \n", "very | \n", "excited | \n", "to | \n", "see | \n", "this | \n", "... | \n", "overlook | \n", "easy | \n", "answers | \n", "alike | \n", "iq | \n", "deff | \n", "comparable | \n", "compensate | \n", "company | \n", "acceptable | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
PoN | \n", "\n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " |
N | \n", "2 | \n", "2 | \n", "12 | \n", "4 | \n", "4 | \n", "3 | \n", "1 | \n", "6 | \n", "2 | \n", "2 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
N | \n", "0 | \n", "0 | \n", "1 | \n", "0 | \n", "1 | \n", "0 | \n", "0 | \n", "1 | \n", "1 | \n", "1 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
N | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "1 | \n", "0 | \n", "1 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
N | \n", "0 | \n", "0 | \n", "1 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "2 | \n", "0 | \n", "1 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
N | \n", "0 | \n", "0 | \n", "2 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "4 | \n", "0 | \n", "4 | \n", "... | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "0 | \n", "
5 rows × 4388 columns
\n", "\n", " | 0 | \n", "PoN | \n", "sentences | \n", "num_sentences | \n", "tokens | \n", "num_tokens | \n", "no_sw | \n", "num_no_sw | \n", "topwords_unfil | \n", "topwords_fil | \n", "... | \n", "diy_cleaner | \n", "bow_v2 | \n", "pruned | \n", "bow_v3 | \n", "bow_v4 | \n", "bow_v5 | \n", "no_shared_words | \n", "bow_v6 | \n", "no_neg_words | \n", "bow_v7 | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "Missed Opportunity\\nI had been very excited t... | \n", "N | \n", "[ Missed Opportunity\\nI had been very excited ... | \n", "1 | \n", "[missed, opportunity, i, had, been, very, exci... | \n", "306 | \n", "[missed, opportunity, excited, see, movie, eve... | \n", "140 | \n", "[(of, 13), (i, 12), (the, 12), (that, 10), (it... | \n", "[(movie, 3), (said, 3), (many, 3), (times, 3),... | \n", "... | \n", "missed opportunity missed opportunity i had ... | \n", "{'missed': 3, 'opportunity': 3, 'i': 14, 'had'... | \n", "missed opportunity missed opportunity been ver... | \n", "{'missed': 3, 'opportunity': 3, 'been': 4, 've... | \n", "{'missed': 2, 'opportunity': 2, 'i': 12, 'had'... | \n", "{'missed_opportunity': 2, 'opportunity_i': 1, ... | \n", "[missed, opportunity, i, had, excited, to, see... | \n", "{'missed': 2, 'opportunity': 2, 'i': 12, 'had'... | \n", "[missed, opportunity, i, had, been, very, exci... | \n", "{'missed': 2, 'opportunity': 2, 'i': 12, 'had'... | \n", "
1 | \n", "5/5 for Phoenix's acting..\\nI don't think the... | \n", "N | \n", "[ 5/5 for Phoenix's acting.., I don't think th... | \n", "5 | \n", "[for, phoenix, i, do, think, there, was, a, ne... | \n", "59 | \n", "[phoenix, think, need, super, dark, film, tbh,... | \n", "25 | \n", "[(was, 4), (a, 3), (that, 3), (for, 2), (there... | \n", "[(dark, 2), (phoenix, 1), (think, 1), (need, 1... | \n", "... | \n", "5/5 for phoenix's acting.. 5/5 for phoenix's... | \n", "{'5/5': 2, 'for': 3, 'phoenix's': 2, 'acting':... | \n", "phoenix's acting.. phoenix's acting.. dont thi... | \n", "{'phoenix's': 2, 'acting': 2, '..': 2, 'dont':... | \n", "{'for': 2, 'phoenix': 1, 'i': 1, 'do': 1, 'thi... | \n", "{'for_phoenix': 1, 'phoenix_i': 1, 'i_do': 1, ... | \n", "[for, i, do, was, a, for, a, tbh, is, a, dc, h... | \n", "{'for': 2, 'i': 1, 'do': 1, 'was': 4, 'a': 3, ... | \n", "[for, phoenix, i, do, think, there, was, a, ne... | \n", "{'for': 2, 'phoenix': 1, 'i': 1, 'do': 1, 'thi... | \n", "
2 | \n", "Everyone praised an overrated movie.\\nOverrat... | \n", "N | \n", "[ Everyone praised an overrated movie., Overra... | \n", "2 | \n", "[everyone, praised, an, overrated, movie, over... | \n", "26 | \n", "[everyone, praised, overrated, movie, overrate... | \n", "13 | \n", "[(overrated, 2), (movie, 2), (everyone, 1), (p... | \n", "[(overrated, 2), (movie, 2), (everyone, 1), (p... | \n", "... | \n", "everyone praised an overrated movie. everyon... | \n", "{'everyone': 2, 'praised': 2, 'an': 2, 'overra... | \n", "everyone praised overrated movie. everyone pra... | \n", "{'everyone': 2, 'praised': 2, 'overrated': 3, ... | \n", "{'everyone': 1, 'praised': 1, 'an': 1, 'overra... | \n", "{'everyone_praised': 1, 'praised_an': 1, 'an_o... | \n", "[praised, an, of, all, the, are, out, to, be, ... | \n", "{'praised': 1, 'an': 1, 'of': 1, 'all': 1, 'th... | \n", "[everyone, praised, an, overrated, movie, over... | \n", "{'everyone': 1, 'praised': 1, 'an': 1, 'overra... | \n", "
3 | \n", "What idiotic FIlm\\nI can say that Phoenix is ... | \n", "N | \n", "[ What idiotic FIlm\\nI can say that Phoenix is... | \n", "4 | \n", "[what, idiotic, film, i, can, say, that, phoen... | \n", "66 | \n", "[idiotic, film, say, phoenix, master, actor, b... | \n", "36 | \n", "[(and, 4), (is, 2), (make, 2), (movie, 2), (to... | \n", "[(make, 2), (movie, 2), (idiotic, 1), (film, 1... | \n", "... | \n", "what idiotic film what idiotic film i can sa... | \n", "{'what': 2, 'idiotic': 2, 'film': 2, 'i': 1, '... | \n", "what idiotic film what idiotic film that phoen... | \n", "{'what': 2, 'idiotic': 2, 'film': 2, 'that': 1... | \n", "{'what': 1, 'idiotic': 1, 'film': 1, 'i': 1, '... | \n", "{'what_idiotic': 1, 'idiotic_film': 1, 'film_i... | \n", "[idiotic, i, can, say, is, master, bt, not, a,... | \n", "{'idiotic': 1, 'i': 1, 'can': 1, 'say': 1, 'is... | \n", "[what, idiotic, film, i, can, say, that, phoen... | \n", "{'what': 1, 'idiotic': 1, 'film': 1, 'i': 1, '... | \n", "
4 | \n", "Terrible\\nThe only thing good about this movi... | \n", "N | \n", "[ Terrible\\nThe only thing good about this mov... | \n", "9 | \n", "[terrible, the, only, thing, good, about, this... | \n", "124 | \n", "[terrible, thing, good, movie, phoenixs, actin... | \n", "65 | \n", "[(the, 5), (this, 4), (movie, 4), (it, 4), (to... | \n", "[(movie, 4), (terrible, 3), (acting, 3), (good... | \n", "... | \n", "terrible terrible the only thing good about ... | \n", "{'terrible': 4, 'the': 5, 'only': 1, 'thing': ... | \n", "terrible terrible only thing good about this m... | \n", "{'terrible': 4, 'only': 1, 'thing': 1, 'good':... | \n", "{'terrible': 1, 'the': 1, 'only': 1, 'thing': ... | \n", "{'terrible_the': 1, 'the_only': 1, 'only_thing... | \n", "[terrible, the, is, but, i, and, for, entertai... | \n", "{'terrible': 3, 'the': 5, 'is': 2, 'but': 3, '... | \n", "[terrible, the, only, thing, good, about, this... | \n", "{'terrible': 3, 'the': 5, 'only': 1, 'thing': ... | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
118 | \n", "Nerve-wracking, but in very uncomfortable way... | \n", "P | \n", "[ Nerve-wracking, but in very uncomfortable wa... | \n", "8 | \n", "[but, in, very, uncomfortable, way, why, every... | \n", "57 | \n", "[uncomfortable, way, everybody, keep, saying, ... | \n", "33 | \n", "[(it, 4), (a, 4), (movie, 3), (in, 2), (keep, ... | \n", "[(movie, 3), (keep, 2), (saying, 2), (psycho, ... | \n", "... | \n", "nerve-wracking, but in very uncomfortable way... | \n", "{'nerve-wracking': 2, ',': 2, 'but': 2, 'in': ... | \n", "nerve-wracking, very uncomfortable nerve-wrack... | \n", "{'nerve-wracking': 2, ',': 2, 'very': 2, 'unco... | \n", "{'but': 1, 'in': 1, 'very': 1, 'uncomfortable'... | \n", "{'but_in': 1, 'in_very': 1, 'very_uncomfortabl... | \n", "[but, in, way, why, everybody, keep, it, a, it... | \n", "{'but': 1, 'in': 2, 'way': 1, 'why': 1, 'every... | \n", "[but, in, very, uncomfortable, way, why, every... | \n", "{'but': 1, 'in': 2, 'very': 1, 'uncomfortable'... | \n", "
119 | \n", "Solid film but there are glaring problems\\nOk... | \n", "P | \n", "[ Solid film but there are glaring problems\\nO... | \n", "13 | \n", "[solid, film, but, there, are, glaring, proble... | \n", "628 | \n", "[solid, film, glaring, problems, okay, first, ... | \n", "292 | \n", "[(the, 35), (to, 22), (it, 16), (and, 16), (i,... | \n", "[(joker, 6), (movie, 5), (film, 4), (like, 4),... | \n", "... | \n", "solid film but there are glaring problems so... | \n", "{'solid': 2, 'film': 5, 'but': 6, 'there': 4, ... | \n", "solid film there glaring problems solid film t... | \n", "{'solid': 2, 'film': 5, 'there': 4, 'glaring':... | \n", "{'solid': 1, 'film': 3, 'but': 4, 'there': 2, ... | \n", "{'solid_film': 1, 'film_but': 1, 'but_there': ... | \n", "[solid, but, are, glaring, problems, okay, i, ... | \n", "{'solid': 1, 'but': 5, 'are': 4, 'glaring': 1,... | \n", "[film, but, there, are, glaring, problems, oka... | \n", "{'film': 4, 'but': 5, 'there': 3, 'are': 4, 'g... | \n", "
120 | \n", "Joker > Endgame\\nNeed I say more? Everything ... | \n", "P | \n", "[ Joker > Endgame\\nNeed I say more?, Everythin... | \n", "5 | \n", "[joker, endgame, need, i, say, more, everythin... | \n", "83 | \n", "[joker, endgame, need, say, everything, movie,... | \n", "53 | \n", "[(joker, 3), (movie, 3), (in, 3), (it, 3), (th... | \n", "[(joker, 3), (movie, 3), (masterful, 2), (awes... | \n", "... | \n", "joker > endgame joker > endgame need i say m... | \n", "{'joker': 4, '>': 2, 'endgame': 2, 'need': 1, ... | \n", "joker endgame joker endgame need more everythi... | \n", "{'joker': 4, 'endgame': 2, 'need': 1, 'more': ... | \n", "{'joker': 2, 'endgame': 1, 'need': 1, 'i': 1, ... | \n", "{'joker_endgame': 1, 'endgame_need': 1, 'need_... | \n", "[endgame, i, say, is, masterful, in, single, w... | \n", "{'endgame': 1, 'i': 1, 'say': 1, 'is': 2, 'mas... | \n", "[joker, need, i, say, more, everything, about,... | \n", "{'joker': 3, 'need': 1, 'i': 1, 'say': 1, 'mor... | \n", "
121 | \n", "Absolutely not a 10\\nStrong fanboy and hype r... | \n", "P | \n", "[ Absolutely not a 10\\nStrong fanboy and hype ... | \n", "5 | \n", "[absolutely, not, a, strong, fanboy, and, hype... | \n", "81 | \n", "[absolutely, strong, fanboy, hype, rush, going... | \n", "36 | \n", "[(the, 7), (is, 6), (a, 4), (fanboy, 2), (and,... | \n", "[(fanboy, 2), (movie, 2), (absolutely, 1), (st... | \n", "... | \n", "absolutely not a 10 absolutely not a 10 stro... | \n", "{'absolutely': 2, 'not': 2, 'a': 2, '10': 2, '... | \n", "absolutely absolutely strong fanboy hype rush ... | \n", "{'absolutely': 2, 'strong': 1, 'fanboy': 1, 'h... | \n", "{'absolutely': 1, 'not': 1, 'a_NEG': 4, 'stron... | \n", "{'absolutely_not': 1, 'not_a': 1, 'a_strong': ... | \n", "[not, a, strong, fanboy, and, rush, on, the, i... | \n", "{'not': 1, 'a': 4, 'strong': 1, 'fanboy': 2, '... | \n", "[absolutely, not, a, fanboy, and, hype, rush, ... | \n", "{'absolutely': 1, 'not': 1, 'a': 4, 'fanboy': ... | \n", "
122 | \n", "Overhyped, but it's alright\\nIt's a good film... | \n", "P | \n", "[ Overhyped, but it's alright\\nIt's a good fil... | \n", "3 | \n", "[overhyped, but, it, alright, it, a, good, fil... | \n", "60 | \n", "[overhyped, alright, good, film, see, like, ma... | \n", "31 | \n", "[(it, 4), (but, 3), (a, 3), (good, 2), (do, 2)... | \n", "[(good, 2), (overhyped, 1), (alright, 1), (fil... | \n", "... | \n", "overhyped, but it's alright overhyped, but i... | \n", "{'overhyped': 2, ',': 2, 'but': 4, 'it's': 2, ... | \n", "overhyped, it's alright overhyped, it's alrigh... | \n", "{'overhyped': 2, ',': 2, 'it's': 2, 'alright':... | \n", "{'overhyped': 1, 'but': 3, 'it': 4, 'alright':... | \n", "{'overhyped_but': 1, 'but_it': 1, 'it_alright'... | \n", "[but, it, alright, it, a, but, i, do, see, as,... | \n", "{'but': 3, 'it': 4, 'alright': 1, 'a': 3, 'i':... | \n", "[overhyped, but, it, alright, it, a, good, fil... | \n", "{'overhyped': 1, 'but': 3, 'it': 4, 'alright':... | \n", "
246 rows × 53 columns
\n", "