DseGraphFrame

Instance Constructors

new DseGraphFrame(gf: GraphFrame, _graphName: Option[String] = None, _graphSchema: Option[SerializableSchema] = None)

Value Members

final def !=(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def ##(): Int

Definition Classes
AnyRef → Any
final def ==(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def E(): DseGraphTraversal[Edge]

Returned graph traversal supports subset of TinkerPop3 traversal steps
Returned graph traversal supports subset of TinkerPop3 traversal steps
returns
GraphTraversal[Edge] for the graph
def V(): DseGraphTraversal[Vertex]

Returned graph traversal supports subset of TinkerPop3 traversal steps
Returned graph traversal supports subset of TinkerPop3 traversal steps
returns
GraphTraversal[Vertex] for the graph
var _graphName: Option[String]
var _graphSchema: Option[SerializableSchema]

Attributes
protected
final def asInstanceOf[T0]: T0

Definition Classes
Any
def cache(): DseGraphFrame.this.type

proxy call to gf.cache()
proxy call to gf.cache()
returns
this
def cleanUp: String

Remove any invalid vertex property and edge entries from the database backend.
Remove any invalid vertex property and edge entries from the database backend. Call this method if you get internal errors or inconsistent results from any graph queries it is strongly recommended to run nodetool repair graphName before and then again after this call the call revises graph database storage and fixes following problems - delete vertex properties entries of non-existent vertex
- delete vertex properties with unknown ids
- delete edges with unknown/removed edge or vertex labels
- delete edges that points to non-existent vertices
- restore second copy of the edge
- delete second copy of the edge if primary record exist
def clone(): AnyRef

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( ... )
def deleteEdgeProperties(df: DataFrame, properties: String*): Unit

clean edges properties
clean edges properties
properties
delete only selected properties not entire row

Annotations
@varargs()
def deleteEdges(df: DataFrame, cache: Boolean = true): Unit

delete graph edges.
delete graph edges. 4 id columns should be passed to the method +--------------------+--------------------+-------+--------------------+ | src| dst| ~label| id| +--------------------+--------------------+-------+--------------------+ |god:THxdAAAAAAAAAAAA|titan:J474AAAAAAA...| father|da0a9900-8fe1-11e...| +--------------------+--------------------+-------+--------------------+
df
data frame with edge ids: src,dst,~label, id
cache
cache df before processing, true by default for consistence updates. two C* entries need to be deleted for one edge, so no reloads expected between this two calls.
def deleteEdges(df: DataFrame): Unit

shortcut for deleteEdges(df: DataFrame, cache: Boolean = true) for Java
def deleteVertexProperties(df: DataFrame, properties: Seq[String], labels: Seq[String] = Seq.empty, cache: Boolean = true): Unit

clean vertex properties with meta properties
clean vertex properties with meta properties
properties
property names to delete
def deleteVertexProperties(df: DataFrame, properties: String*): Unit

clean vertex properties with meta properties
clean vertex properties with meta properties
properties
property names to delete

Annotations
@varargs()
def deleteVertices(label: String): Unit

delete all vertices with given label
def deleteVertices(df: DataFrame, labels: Seq[String] = Seq.empty, cache: Boolean = true): Unit

delete vertices and all related edges
final def eq(arg0: AnyRef): Boolean

Definition Classes
AnyRef
def equals(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def finalize(): Unit

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( classOf[java.lang.Throwable] )
final def getClass(): Class[_]

Definition Classes
AnyRef → Any
val gf: GraphFrame
def graphName: String
def graphName_=(name: String): Unit

restore or change the name of the graph
def graphSchema: SerializableSchema

Return schema of this graph base on it name NoSuchElementException will be thrown if graph name is unknown and schema can not be retrieved
Return schema of this graph base on it name NoSuchElementException will be thrown if graph name is unknown and schema can not be retrieved
returns
Graph Schema
def hashCode(): Int

Definition Classes
AnyRef → Any
def idColumn(labelColumn: Column, idColumns: Column*): Column

Utility method to generate GraphFrame compatible ids, if a mixed set of labels is in the DF.
Utility method to generate GraphFrame compatible ids, if a mixed set of labels is in the DF. It is slower then String, idColumns: Column*): Column The id is added automatically when vertex is inserted, if inserted columns has the same names as in graph schema It is not possible for edges as you need to point both src and dst ids. Usage: val updateEdgeDF = sourceDF.select (gf.idColumn(col("srcLabel"), col("srcId")) as src, gf.idColumn(col("dstLabel"), col("dstId")) ad dst, col("label") as "~label", gf.randomEdgeIdColumn, col("property")) gf.updateEdges(updateEdgeDF) If different labels have different id format use case statement to sort them: when(col("srcLabel") === "1format", col("src1Id")).when(col("srcLabel") === "2format", col("src2Id")).otherwise(col("src3Id")) as src

Annotations
@varargs()
def idColumn(label: String, idColumns: Column*): Column

Utility method to generate GraphFrame compatible ids.
Utility method to generate GraphFrame compatible ids. The id is added automatically when vertex is inserted, if inserted columns has the same names as in graph schema It is not possible for edges as you need to point both src and dst ids. Usage: val updateEdgeDF = sourceDF.select (gf.idColumn("srcLabel", col("srcId")) ad src, gf.idColumn("dstLabel", col("dstId")) as dst, col("label") as "~label", gf.randomEdgeIdColumn, col("property")) gf.updateEdges(updateEdgeDF)

Annotations
@varargs()
final def isInstanceOf[T0]: Boolean

Definition Classes
Any
def nativeJavaTypeConverter(columnName: String): TypeConverter[_]
final def ne(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def notify(): Unit

Definition Classes
AnyRef
final def notifyAll(): Unit

Definition Classes
AnyRef
def persist(storageLevel: StorageLevel): DseGraphFrame.this.type

proxy call to gf.persist()
proxy call to gf.persist()
returns
this
def persist(): DseGraphFrame.this.type

proxy call to gf.persist()
proxy call to gf.persist()
returns
this
final def synchronized[T0](arg0: ⇒ T0): T0

Definition Classes
AnyRef
def toExternalEdgeIdAsMap(id: AnyRef): Map[String, AnyRef]
def toExternalVertexIdAsMap(id: AnyRef): Map[String, AnyRef]
def toString(): String

Definition Classes
AnyRef → Any
def toSyntheticVertexId(id: AnyRef): String
def unpersist(blocking: Boolean): DseGraphFrame.this.type

proxy call to gf.unpersist()
proxy call to gf.unpersist()
returns
this
def unpersist(): DseGraphFrame.this.type

proxy call to gf.unpersist()
proxy call to gf.unpersist()
returns
this
def updateEdges(outVertexLabel: String, edgeLabel: String, inVertexLabel: String, df: DataFrame): Unit

update this graph edges.
update this graph edges. this method accept natural vertex id columns. Out vertex column names should start with "out_" prefix and in names with "in_". The method will update only one triplet combination. the minimal df schema is: 2 id columns and 0 or more properties columns +-----+------+--------------------+-------------------+ |out_id|in_id| id| prop| +-----+------+--------------------+-------------------+ | 10| a|da0a9900-8fe1-11e...| value| +-----+------+--------------------+-------------------+
id column should contains UUID(0,0).toString() value for single edges and pre-generated UUID for mutli-cardinality edges outVertexLabel->edgeLabel->inVertexLabel is passed as parameters. the df is not cached by the function. the dataframe should be persisted by the user if dynamic data source is used.
df
data frame with edge ids and update columns
def updateEdges(df: DataFrame, cache: Boolean = true): Unit

update this graph edges.
update this graph edges. the minimal df schema is: 4 id columns and at least one property to update +--------------------+--------------------+-------+--------------------+-------------------+ | src| dst| ~label| id| prop| +--------------------+--------------------+-------+--------------------+-------------------+ |god:THxdAAAAAAAAAAAA|titan:J474AAAAAAA...| father|da0a9900-8fe1-11e...| value| +--------------------+--------------------+-------+--------------------+-------------------+
if ID column is not present it will be generated and edges will be saved as new.
df
data frame with edge ids and update columns
cache
cache df before processing, true by default for consistence updates. two C* entries need to be updated for one edge, so no reloads expected between this two calls.
def updateEdges(df: DataFrame): Unit

shortcut for updateEdges(df: DataFrame, cache: Boolean = true) for Java
def updateVertices(vertexLabel: String, df: DataFrame): Unit

update this graph vertices with properties provided in the df.
update this graph vertices with properties provided in the df. you should provide id in non encoded format +-----------------+---------+---------+ | community_id|member_id| age| +-----------------+---------+---------+ | 1182054400| 0| 0| +-----------------+---------+---------+ the df is not cached by the function.
vertexLabel
to update
df
dataframe with vertex id and update columns
def updateVertices(df: DataFrame, labels: Seq[String] = Seq.empty, cache: Boolean = true): Unit

update this graph vertices with properties provided in the df.
update this graph vertices with properties provided in the df. the minimal df schema is just vertex "id" and one property to update: +-----------------+---------+ | id| age| +-----------------+---------+ |god:AAAAATMAAA...| 0| +-----------------+---------+ label and vertices id will be extracted from the graph frame id. for better performance it is recommended to add/leave "~label" column +-----------------+---------+---------+ | id| ~label| age| +-----------------+---------+---------+ |god:AAAAATMAAA...| god| 0| +-----------------+---------+---------+ you can also provide id in non encoded format +-----------------+---------+---------+---------+ | community_id|member_id| ~label| age| +-----------------+---------+---------+---------+ | 1182054400| 0| god| 0| +-----------------+---------+---------+---------+ Note: passing both synthetic "id" and vertex Id columns is an error.
df
dataframe with vertex id and update columns
labels
empty (means all) by default, it is convenient to group vertexes with the same id format. That group could be passed here, to reduce number of verification steps
cache
cache df before processing, true by default for consistence update and performance
def updateVertices(df: DataFrame): Unit

shortcut for updateVertices(df: DataFrame, labels: Seq[String] = Seq.empty, cache: Boolean = true) for Java API
shortcut for updateVertices(df: DataFrame, labels: Seq[String] = Seq.empty, cache: Boolean = true) for Java API
df
dataframe with vertex id and update columns
final def wait(): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long, arg1: Int): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )

Related Docs: object DseGraphFrame | package graphframe

class DseGraphFrame extends Serializable

Instance Constructors

new DseGraphFrame(gf: GraphFrame, _graphName: Option[String] = None, _graphSchema: Option[SerializableSchema] = None)

Value Members

final def !=(arg0: Any): Boolean

final def ##(): Int

final def ==(arg0: Any): Boolean

def E(): DseGraphTraversal[Edge]

def V(): DseGraphTraversal[Vertex]

var _graphName: Option[String]

var _graphSchema: Option[SerializableSchema]

final def asInstanceOf[T0]: T0

def cache(): DseGraphFrame.this.type

def cleanUp: String

def clone(): AnyRef

def deleteEdgeProperties(df: DataFrame, properties: String*): Unit

def deleteEdges(df: DataFrame, cache: Boolean = true): Unit

def deleteEdges(df: DataFrame): Unit

def deleteVertexProperties(df: DataFrame, properties: Seq[String], labels: Seq[String] = Seq.empty, cache: Boolean = true): Unit

def deleteVertexProperties(df: DataFrame, properties: String*): Unit

def deleteVertices(label: String): Unit

def deleteVertices(df: DataFrame, labels: Seq[String] = Seq.empty, cache: Boolean = true): Unit

final def eq(arg0: AnyRef): Boolean

def equals(arg0: Any): Boolean

def finalize(): Unit

final def getClass(): Class[_]

val gf: GraphFrame

def graphName: String

def graphName_=(name: String): Unit

def graphSchema: SerializableSchema

def hashCode(): Int

def idColumn(labelColumn: Column, idColumns: Column*): Column

def idColumn(label: String, idColumns: Column*): Column

final def isInstanceOf[T0]: Boolean

def nativeJavaTypeConverter(columnName: String): TypeConverter[_]

final def ne(arg0: AnyRef): Boolean

final def notify(): Unit

final def notifyAll(): Unit

def persist(storageLevel: StorageLevel): DseGraphFrame.this.type

def persist(): DseGraphFrame.this.type

final def synchronized[T0](arg0: ⇒ T0): T0

def toExternalEdgeIdAsMap(id: AnyRef): Map[String, AnyRef]

def toExternalVertexIdAsMap(id: AnyRef): Map[String, AnyRef]

def toString(): String

def toSyntheticVertexId(id: AnyRef): String

def unpersist(blocking: Boolean): DseGraphFrame.this.type

def unpersist(): DseGraphFrame.this.type

def updateEdges(outVertexLabel: String, edgeLabel: String, inVertexLabel: String, df: DataFrame): Unit

def updateEdges(df: DataFrame, cache: Boolean = true): Unit

def updateEdges(df: DataFrame): Unit

def updateVertices(vertexLabel: String, df: DataFrame): Unit

def updateVertices(df: DataFrame, labels: Seq[String] = Seq.empty, cache: Boolean = true): Unit

def updateVertices(df: DataFrame): Unit

final def wait(): Unit

final def wait(arg0: Long, arg1: Int): Unit

final def wait(arg0: Long): Unit

Inherited from Serializable

Inherited from Serializable

Inherited from AnyRef

Inherited from Any

Ungrouped