Commit graph

18 commits

Author SHA1 Message Date
804f966a00 Adds stopwords and ngram-range to tfidf-vectorizer 2023-06-28 12:34:22 +02:00
4f4af74259 Adds pickling of tfidf vectorizer 2023-06-28 01:47:56 +02:00
353b9f85cf Removes unused print statement 2023-06-28 00:13:25 +02:00
d38a3002c6 Improves saving of tweets_all_combined 2023-06-28 00:11:58 +02:00
b2b903f45a Applies gitignore changes 2023-06-27 23:52:20 +02:00
8fcbb40b2a Adds pickles tfidf matrix to gitingore 2023-06-27 23:51:43 +02:00
d1348391f9 Adds pickling of tfid and saving relevance scores 2023-06-27 23:50:26 +02:00
12cfbc5222 Adds basic code for searchable tf-idf matrix 2023-06-23 19:19:19 +02:00
2684b4b8c7 Adds requirements.txt 2023-06-18 15:03:41 +02:00
2c50cc9b72 Adds script for general dataset anaylsis 2023-06-17 16:54:56 +02:00
8647f32ca8 Adds env/ to gitignore 2023-06-17 12:21:03 +02:00
5ccecf9a90 Refactors repo structure 2023-06-17 12:17:15 +02:00
9450ec940b Update 'README.md' 2023-03-28 15:39:31 +02:00
36de4fdf81 finished tidying data 2023-03-28 15:35:03 +02:00
880c989c03 added readme 2023-03-28 15:34:38 +02:00
abe05ce248 unifying handles and usernames 2023-03-27 21:30:05 +02:00
5816077c16 added old data and began writing code for merch 2023-03-26 21:50:55 +02:00
8d3c8b3974 init 2023-03-26 18:36:49 +02:00