Just started working on this small project where one can just convert their whatsapp conversations into an instruction dataset which can be finetuned with any OSS models for creating a clone of themselves.
Have added sensitive info redaction and also general whatsapp messages redaction when parsing and compiling the dataset.
Very early but appreciate any suggestions or discussions around this.