List of usage examples for org.apache.hadoop.mapreduce.lib.input FileInputFormat subclass-usage
From source file fi.tkk.ics.hadoop.bam.VCFInputFormat.java
/** An {@link org.apache.hadoop.mapreduce.InputFormat} for VCF files. Values * are the individual records; see {@link VCFRecordReader} for the meaning of * the key. */ public class VCFInputFormat extends FileInputFormat<LongWritable, VariantContextWritable> { /** Whether file extensions are to be trusted, defaults to true.
From source file fire.util.fileformats.pdf.PdfInputFormat.java
public class PdfInputFormat extends FileInputFormat<Text, Text> { @Override public RecordReader<Text, Text> createRecordReader(InputSplit split, TaskAttemptContext context) throws IOException, InterruptedException {
From source file format.IndexableFileInputFormat.java
/** * Abstract class representing a <code>FileInputFormat</code> for * <code>Indexable</code> objects. */ public abstract class IndexableFileInputFormat<K, V extends Indexable> extends FileInputFormat<K, V> {
From source file format.OverlapInputFormat.java
/**
* An {@link org.apache.hadoop.mapreduce.InputFormat} for 4mc compressed files.
* This is the base class, mainly managing input splits, leveraging 4mc block index.
*
* Subclasses must only make sure to provide an implementation of createRecordReader.
* See {@link com.hadoop.mapreduce.FourMcTextInputFormat} as example reading text files.
From source file format.OverlapLengthInputFormat.java
/**
* FixedLengthInputFormat is an input format used to read input files
* which contain fixed length records. The content of a record need not be
* text. It can be arbitrary binary data. Users must configure the record
* length property by calling:
* OverlapLengthInputFormat.setRecordLength(conf, recordLength);<br><br> or
From source file fr.ens.biologie.genomique.eoulsan.bio.io.hadoop.FastqInputFormat.java
/**
* This class define an InputFormat for FASTQ files for the Hadoop MapReduce
* framework.
* @since 1.0
* @author Laurent Jourdren
*/
From source file framework.util.IndexableFileInputFormat.java
/** * Abstract class representing a <code>FileInputFormat</code> for * <code>Indexable</code> objects. */ public abstract class IndexableFileInputFormat<K, V extends Indexable> extends FileInputFormat<K, V> {
From source file genepi.hadoop.formats.NLineInputFormat.java
/**
* NLineInputFormat which splits N lines of input as one split.
*
* In many "pleasantly" parallel applications, each process/mapper processes the
* same input file (s), but with computations are controlled by different
* parameters.(Referred to as "parameter sweeps"). One way to achieve this, is
From source file gov.jgi.meta.hadoop.input.FastaBlockInputFormat.java
/** An {@link FastaBlockInputFormat} is for fasta text files. Files are broken
* records seperated by ">" eg:
* >756:1:1:1074:20235/1
* TGCAGCTCAACANCGTCGGCTACGACNNCACCNNNGAGCGCATCGGCTNCNNNANNNCCTNNNNNNNNCGGGAGGT
* >756:1:1:1074:20235/2
* TCGTCGCTGAAGCCTTCTTCCACCTTGGCGTTGAACGCCTCCATGTCCAGTGGAGTCCCCTGGACCCCGCGCCCGC
From source file gov.jgi.meta.hadoop.input.FastaInputFormat.java
/** An {@link FastaInputFormat} is for fasta text files. Files are broken
* records seperated by ">" eg:
* >756:1:1:1074:20235/1
* TGCAGCTCAACANCGTCGGCTACGACNNCACCNNNGAGCGCATCGGCTNCNNNANNNCCTNNNNNNNNCGGGAGGT
* >756:1:1:1074:20235/2
* TCGTCGCTGAAGCCTTCTTCCACCTTGGCGTTGAACGCCTCCATGTCCAGTGGAGTCCCCTGGACCCCGCGCCCGC