Skip to main content
Technical

Given a large set of CSV files with thousands of paragraphs each, how would you detect duplicates within each file, and how would you scale this solution for many files?

Top community answer

No community answers yet.

Contribute an answer

Have a better answer?

Share your experience and earn credits toward your next interview session.

Contribute an answer