Computing with Data

elgeish

639.3K views

GitHub

Open Source Your Knowledge, Become a Contributor

Technology knowledge has to be shared and made accessible for free. Join the movement.

Create Content

Previous: Working with Databases Next: Retry Policies

Partitioning

To illustrate how this partitioning scheme allows for a balanced cluster assignment, we used 4450 email addresses from the Enron dataset to simulate arbitrary email addresses (keys) and we calculated how they would be assigned across our 5 clusters using the Python script below:

Open Source Your Knowledge: become a Contributor and help others learn. Create New Content

Open Source Your Knowledge, Become a Contributor

292/300 Partitioning

Partitioning

PYTHON: BEGINNER QUIZ (10 Questions)

Simple Python Test

9a7ba

Python from Zero to Hero