apache spark - Where is the reference for options for writing or reading per format? -


i use spark 1.6.1.

we trying write orc file hdfs using hivecontext , dataframewriter. while can use

df.write().orc(<path>) 

we rather like

df.write().options(map("format" -> "orc", "path" -> "/some_path") 

this have flexibility change format or root path depending on application uses helper library. can find reference options can passed dataframewriter? found nothing in docs here

https://spark.apache.org/docs/1.6.0/api/java/org/apache/spark/sql/dataframewriter.html#options(java.util.map)

where can find reference options can passed dataframewriter?

the definitive , authoritative answer sources:

some description may find in docs, there no single page (that possibly auto-generated sources stay up-to-date most).

the reason being options separated format implementation on purpose have flexibility want offer per use case (as duly noted):

this have flexibility change format or root path depending on application uses helper library.


your question seems similar how know file formats supported databricks? said:

where can list of options supported each file format?

that's not possible there no api follow (like in spark mllib) define options. every format on own...unfortunately , best bet read documentation or (more authoritative) source code.


Comments

Popular posts from this blog

Is there a better way to structure post methods in Class Based Views -

performance - Why is XCHG reg, reg a 3 micro-op instruction on modern Intel architectures? -

jquery - Responsive Navbar with Sub Navbar -