Technical
Given a large set of CSV files with thousands of paragraphs each, how would you detect duplicates within each file, and how would you scale this solution for many files?
Top community answer
No community answers yet.
Contribute an answerHave a better answer?
Share your experience and earn credits toward your next interview session.
Contribute an answer