How to use Data Science Superpowers for Useless Things: Organizing Ancient Projects, in Python
OK so, let’s go step by step:
- First, we find the external hard drive (duh)
- Then, we find the folder we want to “Flatten and Rename” (also, duh)
- Then, we go about the process of “Flattening and Renaming” the files using our PORTFOLIO_SCRAPING_EHD_RENAME_AND_MOVE file (or the code posted on Daniel daily log 02-13-21)
- Then, once we have the “Flat File” (usually appended with an __ALL), we can go through the process of SORTING the flat file. This uses a different python file. This uses the (currently titled) PORTFOLIO_SCRAPING_EHD_CLEAN file and the associated csv files with extension and parent folder locations
Remind me, why are we doing all of this again?
Because there is A LOT of junk on these external hard drives. And because the current way they are storing the files (ummmm there is no system or convention because I change those faster than I change uniform iterations) does NOT lend itself to easy retrieval. If I had been better at naming, maybe we’d have a different story. I DO have that google colab where I can find all files with X extension or with Y in the name… but this doesn’t help me answer the question “What projects did I work on during the last week of DBC, August 2012?”