patient_conversation-positive.txt, patient_conversation-negative.txt and patient_conversations-test.txt are written in append-a mode instead of write-w mode, so if the cell is run multiple times in the Jupyter notebook without restarting the kernel, it can lead to replication and hence duplication of data.