List of usage examples for org.apache.hadoop.mapreduce RecordReader subclass-usage
From source file uk.bl.wa.hadoop.mapreduce.lib.input.ByteBlockRecordReader.java
public class ByteBlockRecordReader extends RecordReader<Path, BytesWritable> { private static final Log log = LogFactory.getLog(ByteBlockRecordReader.class); private FSDataInputStream fsdis; private Path path; private BytesWritable buf = new BytesWritable();
From source file uk.bl.wa.hadoop.mapreduce.WebArchiveRecordReader.java
/** * @author Andrew Jackson <Andrew.Jackson@bl.uk> * */ public class WebArchiveRecordReader extends RecordReader<Text, WritableArchiveRecord> {
From source file uk.org.taverna.platform.execution.impl.hadoop.TavernaRecordReader.java
/** * * * @author David Withers */ public class TavernaRecordReader extends RecordReader<LongWritable, MapWritable> {
From source file voldemort.hadoop.VoldemortRecordReader.java
public class VoldemortRecordReader extends RecordReader<ByteArray, Versioned<byte[]>> { private AdminClient adminClient; private Iterator<Pair<ByteArray, Versioned<byte[]>>> iter = null; private Pair<ByteArray, Versioned<byte[]>> currentPair = null;
From source file vtpassim.TarballReader.java
/**
* TarballReader.
*
* Outputs for file included in a tarball a key/value pair where the key is
* the file name appended with date and time (.DYYMMDD.THHMMSS) and the value
* is the content of the file.
From source file webindex.data.util.WARCFileRecordReader.java
/**
* The WARC File Record Reader processes a single compressed input. The Record Reader returns a
* single WARC ArchiveReader that can contain numerous individual documents, each document handled
* in a single mapper.
*
* @author Stephen Merity (Smerity)
From source file wikiParser.mapReduce.util.KeyValueLineRecordReader.java
/**
* This class treats a line in the input as a key/value pair separated by a
* separator character. The separator can be specified in config file
* under the attribute name mapreduce.input.keyvaluelinerecordreader.key.value.separator. The default
* separator is the tab character ('\t').
*/