public static class InputSampler.RandomSampler<K,V> extends Object implements InputSampler.Sampler<K,V>
构造器和说明 |
---|
InputSampler.RandomSampler(double freq,
int numSamples)
Create a new RandomSampler sampling all splits.
|
InputSampler.RandomSampler(double freq,
int numSamples,
int maxSplitsSampled)
Create a new RandomSampler.
|
限定符和类型 | 方法和说明 |
---|---|
K[] |
getSample(InputFormat<K,V> inf,
JobConf job)
Randomize the split order, then take the specified number of keys from
each split sampled, where each key is selected with the specified
probability and possibly replaced by a subsequently selected key when
the quota of keys from that split is satisfied.
|
public InputSampler.RandomSampler(double freq, int numSamples)
freq
- Probability with which a key will be chosen.numSamples
- Total number of samples to obtain from all selected
splits.public InputSampler.RandomSampler(double freq, int numSamples, int maxSplitsSampled)
freq
- Probability with which a key will be chosen.numSamples
- Total number of samples to obtain from all selected
splits.maxSplitsSampled
- The maximum number of splits to examine.public K[] getSample(InputFormat<K,V> inf, JobConf job) throws IOException
getSample
在接口中 InputSampler.Sampler<K,V>
IOException
Copyright © 2009 The Apache Software Foundation