Class/Object

com.twitter.scalding.source

DailySuffixTypedTsv

Related Docs: object DailySuffixTypedTsv | package source

Permalink

class DailySuffixTypedTsv[T] extends DailySuffixSource with TypedDelimited[T]

Source
DailySources.scala
Linear Supertypes
Type Hierarchy
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. DailySuffixTypedTsv
  2. TypedDelimited
  3. TypedSink
  4. Mappable
  5. TypedSource
  6. DelimitedScheme
  7. DailySuffixSource
  8. TimePathedSource
  9. TimeSeqPathedSource
  10. FileSource
  11. HfsTapProvider
  12. LocalSourceOverride
  13. SchemedSource
  14. Source
  15. Serializable
  16. AnyRef
  17. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new DailySuffixTypedTsv(prefix: String)(implicit dateRange: DateRange, mf: Manifest[T], conv: TupleConverter[T], tset: TupleSetter[T])

    Permalink

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. def addTypes(sel: Array[Comparable[_]]): Fields

    Permalink
    Definition Classes
    TypedDelimited
  5. def allPaths: Iterable[String]

    Permalink

    These are all the paths we will read for this data completely enumerated

    These are all the paths we will read for this data completely enumerated

    Definition Classes
    TimeSeqPathedSource
  6. def allPathsFor(pattern: String): Iterable[String]

    Permalink
    Attributes
    protected
    Definition Classes
    TimeSeqPathedSource
  7. def andThen[U](fn: (T) ⇒ U): typed.TypedSource[U]

    Permalink

    Transform this TypedSource into another by mapping after.

    Transform this TypedSource into another by mapping after. We don't call this map because of conflicts with Mappable, unfortunately

    Definition Classes
    TypedSource
  8. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  9. def checkFlowDefNotNull()(implicit flowDef: FlowDef, mode: Mode): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Source
  10. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  11. def contraMap[U](fn: (U) ⇒ T): typed.TypedSink[U]

    Permalink

    Transform this sink into another type by applying a function first

    Transform this sink into another type by applying a function first

    Definition Classes
    TypedSink
  12. implicit val conv: TupleConverter[T]

    Permalink
    Definition Classes
    DailySuffixTypedTsvTypedDelimited
  13. def converter[U >: T]: TupleConverter[U]

    Permalink

    Because TupleConverter cannot be covariant, we need to jump through this hoop.

    Because TupleConverter cannot be covariant, we need to jump through this hoop. A typical implementation might be: (implicit conv: TupleConverter[T]) and then:

    override def converter[U >: T] = TupleConverter.asSuperConverter[T, U](conv)

    Definition Classes
    TypedDelimitedTypedSource
  14. def createHdfsReadTap(hdfsMode: Hdfs): Tap[JobConf, _, _]

    Permalink
    Attributes
    protected
    Definition Classes
    FileSource
  15. def createHfsTap(scheme: Scheme[JobConf, RecordReader[_, _], OutputCollector[_, _], _, _], path: String, sinkMode: SinkMode): Hfs

    Permalink
    Definition Classes
    HfsTapProvider
  16. def createLocalTap(sinkMode: SinkMode): Tap[JobConf, _, _]

    Permalink

    Creates a local tap.

    Creates a local tap.

    sinkMode

    The mode for handling output conflicts.

    returns

    A tap.

    Definition Classes
    LocalSourceOverride
  17. def createTap(readOrWrite: AccessMode)(implicit mode: Mode): Tap[_, _, _]

    Permalink

    Subclasses of Source MUST override this method.

    Subclasses of Source MUST override this method. They may call out to TestTapFactory for making Taps suitable for testing.

    Definition Classes
    FileSourceSource
  18. implicit val dateRange: DateRange

    Permalink
  19. def defaultDurationFor(pattern: String): Option[Duration]

    Permalink

    Override this if you have for instance an hourly pattern but want to run every 6 hours.

    Override this if you have for instance an hourly pattern but want to run every 6 hours. By default, we call TimePathedSource.stepSize(pattern, tz)

    Attributes
    protected
    Definition Classes
    TimeSeqPathedSource
  20. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  21. def equals(that: Any): Boolean

    Permalink
    Definition Classes
    TimeSeqPathedSource → AnyRef → Any
  22. val fields: Fields

    Permalink
    Definition Classes
    TypedDelimitedDelimitedScheme
  23. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  24. final def flatMapTo[U](out: Fields)(mf: (T) ⇒ TraversableOnce[U])(implicit flowDef: FlowDef, mode: Mode, setter: TupleSetter[U]): Pipe

    Permalink

    If you want to filter, you should use this and output a 0 or 1 length Iterable.

    If you want to filter, you should use this and output a 0 or 1 length Iterable. Filter does not change column names, and we generally expect to change columns here

    Definition Classes
    Mappable
  25. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  26. def getPathStatuses(conf: Configuration): Iterable[(String, Boolean)]

    Permalink

    Get path statuses based on daterange.

    Get path statuses based on daterange. This tests each path with pathIsGood (which by default checks that there is at least on file in that directory)

    Definition Classes
    TimeSeqPathedSource
  27. def goodHdfsPaths(hdfsMode: Hdfs): Iterable[String]

    Permalink
    Attributes
    protected
    Definition Classes
    FileSource
  28. def hashCode(): Int

    Permalink
    Definition Classes
    TimeSeqPathedSource → AnyRef → Any
  29. def hdfsPaths: Iterable[String]

    Permalink
    Definition Classes
    TimeSeqPathedSourceFileSource
  30. def hdfsReadPathsAreGood(conf: Configuration): Boolean

    Permalink
    Definition Classes
    TimeSeqPathedSourceFileSource
  31. def hdfsScheme: Scheme[JobConf, RecordReader[_, _], OutputCollector[_, _], _, _]

    Permalink

    The scheme to use if the source is on hdfs.

    The scheme to use if the source is on hdfs.

    Definition Classes
    DelimitedSchemeSchemedSource
  32. def hdfsWritePath: String

    Permalink
    Definition Classes
    TimePathedSourceFileSource
  33. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  34. def localPaths: Iterable[String]

    Permalink

    A path to use for the local tap.

    A path to use for the local tap.

    Definition Classes
    TimePathedSourceLocalSourceOverride
  35. def localScheme: TextDelimited

    Permalink

    The scheme to use if the source is local.

    The scheme to use if the source is local.

    Definition Classes
    DelimitedSchemeSchemedSource
  36. def localWritePath: String

    Permalink
    Definition Classes
    LocalSourceOverride
  37. final def mapTo[U](out: Fields)(mf: (T) ⇒ U)(implicit flowDef: FlowDef, mode: Mode, setter: TupleSetter[U]): Pipe

    Permalink
    Definition Classes
    Mappable
  38. implicit val mf: Manifest[T]

    Permalink
    Definition Classes
    DailySuffixTypedTsvTypedDelimited
  39. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  40. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  41. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  42. def pathIsGood(globPattern: String, conf: Configuration): Boolean

    Permalink

    Determines if a path is 'valid' for this source.

    Determines if a path is 'valid' for this source. In strict mode all paths must be valid. In non-strict mode, all invalid paths will be filtered out.

    Subclasses can override this to validate paths.

    The default implementation is a quick sanity check to look for missing or empty directories. It is necessary but not sufficient -- there are cases where this will return true but there is in fact missing data.

    TODO: consider writing a more in-depth version of this method in TimePathedSource that looks for TODO: missing days / hours etc.

    Attributes
    protected
    Definition Classes
    FileSource
  43. val pattern: String

    Permalink
    Definition Classes
    TimePathedSource
  44. val patterns: Seq[String]

    Permalink
    Definition Classes
    TimeSeqPathedSource
  45. val quote: String

    Permalink
    Definition Classes
    DelimitedScheme
  46. def read(implicit flowDef: FlowDef, mode: Mode): Pipe

    Permalink
    Definition Classes
    Source
  47. val safe: Boolean

    Permalink
    Definition Classes
    DelimitedScheme
  48. val separator: String

    Permalink
    Definition Classes
    TypedDelimitedDelimitedScheme
  49. def setter[U <: T]: TupleSetter[U]

    Permalink
    Definition Classes
    TypedDelimitedTypedSink
  50. final def sinkFields: Fields

    Permalink
    Definition Classes
    TypedDelimitedTypedSink
  51. val sinkMode: SinkMode

    Permalink
    Definition Classes
    SchemedSource
  52. val skipHeader: Boolean

    Permalink
    Definition Classes
    TypedDelimitedDelimitedScheme
  53. def sourceFields: Fields

    Permalink
    Definition Classes
    TypedSource
  54. def sourceId: String

    Permalink

    This is a name the refers to this exact instance of the source (put another way, if s1.sourceId == s2.sourceId, the job should work the same if one is replaced with the other

    This is a name the refers to this exact instance of the source (put another way, if s1.sourceId == s2.sourceId, the job should work the same if one is replaced with the other

    Definition Classes
    Source
  55. val strict: Boolean

    Permalink
    Definition Classes
    DelimitedScheme
  56. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  57. def toIterator(implicit config: Config, mode: Mode): Iterator[T]

    Permalink

    Allows you to read a Tap on the submit node NOT FOR USE IN THE MAPPERS OR REDUCERS.

    Allows you to read a Tap on the submit node NOT FOR USE IN THE MAPPERS OR REDUCERS. Typical use might be to read in Job.next to determine if another job is needed

    Definition Classes
    Mappable
  58. def toString(): String

    Permalink
    Definition Classes
    TimeSeqPathedSource → AnyRef → Any
  59. def transformForRead(pipe: Pipe): Pipe

    Permalink
    Attributes
    protected
    Definition Classes
    Source
  60. def transformForWrite(pipe: Pipe): Pipe

    Permalink
    Attributes
    protected
    Definition Classes
    Source
  61. def transformInTest: Boolean

    Permalink

    The mock passed in to scalding.JobTest may be considered as a mock of the Tap or the Source.

    The mock passed in to scalding.JobTest may be considered as a mock of the Tap or the Source. By default, as of 0.9.0, it is considered as a Mock of the Source. If you set this to true, the mock in TestMode will be considered to be a mock of the Tap (which must be transformed) and not the Source.

    Definition Classes
    Source
  62. implicit val tset: TupleSetter[T]

    Permalink
    Definition Classes
    DailySuffixTypedTsvTypedDelimited
  63. val types: Array[Class[_]]

    Permalink
    Definition Classes
    TypedDelimitedDelimitedScheme
  64. val tz: TimeZone

    Permalink
    Definition Classes
    TimeSeqPathedSource
  65. def validateTaps(mode: Mode): Unit

    Permalink
    Definition Classes
    FileSourceSource
  66. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  67. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  68. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  69. def writeFrom(pipe: Pipe)(implicit flowDef: FlowDef, mode: Mode): Pipe

    Permalink

    write the pipe but return the input so it can be chained into the next operation

    write the pipe but return the input so it can be chained into the next operation

    Definition Classes
    Source
  70. val writeHeader: Boolean

    Permalink
    Definition Classes
    TypedDelimitedDelimitedScheme

Deprecated Value Members

  1. def readAtSubmitter[T](implicit mode: Mode, conv: TupleConverter[T]): Stream[T]

    Permalink
    Definition Classes
    Source
    Annotations
    @deprecated
    Deprecated

    (Since version 0.9.0) replace with Mappable.toIterator

Inherited from TypedDelimited[T]

Inherited from typed.TypedSink[T]

Inherited from Mappable[T]

Inherited from typed.TypedSource[T]

Inherited from DelimitedScheme

Inherited from DailySuffixSource

Inherited from TimePathedSource

Inherited from TimeSeqPathedSource

Inherited from FileSource

Inherited from HfsTapProvider

Inherited from LocalSourceOverride

Inherited from SchemedSource

Inherited from Source

Inherited from Serializable

Inherited from AnyRef

Inherited from Any

Ungrouped