The is the officila graphframes-py PyPI package, which is a Python wrapper for the Scala GraphFrames library. This package is maintained by the GraphFrames project and is available on PyPI.
- Installation
- Creating Graphs
- Basic Graph Manipulations
- Centrality Metrics
- Motif finding
- Traversals and Connectivity
- Community Detection
- Scala API
- Python API
- Apache Spark compatibility
NOTE! Python distribution does not include JVM-core. You need to add it to your cluster or Spark-Connect server!
NOTE! GraphFrames depends on its own version of GraphX. While in case of installation from the Maven Central repository, all the runtime dependencies will be resolved automatically, you may need to add them manually in case of installation from the local repository.
GraphFrames PySpark is choosing connect or classic implementation implicitly based on the result of is_remote().
To enforce usage of connect-based implementation, you may export this variable SPARK_CONNECT_MODE_ENABLED=1
