Details
Description
The current implementation in the ML library is exact but can be a bit slower than it should with large amounts of data. This implementation would have the user specify a parameter for the degree of accuracy required. Local m tiles would be computed (m large enough to achieve the desired accuracy) in parallel, and the information merged to create the final approximate n-tile frequency counts.