graphstorm.dataloading.dataset

GraphStorm dataset provides one unified dataset class, i.e., GSgnnData, for all graph machine learning tasks. Users can build a GSgnnData object by giving the path of the JSON file created by the GraphStorm Graph Construction operations. The GSgnnData will load the related graph artifacts specified in the JSON file. It provides a set of APIs for users to extract information of the graph data for model training and inference.

GSgnnData

The GraphStorm data class.

graphstorm.dataloading.dataloading

GraphStorm dataloading module includes a set of different DataLoaders for different graph machine learning tasks.

If users would like to customize DataLoaders, please extend those dataloader base classes in the Base DataLoaders section and customize their abstract functions.

Base DataLoaders

GSgnnNodeDataLoaderBase

The base dataloader class for node tasks.

GSgnnEdgeDataLoaderBase

The base dataloader class for edge tasks.

GSgnnLinkPredictionDataLoaderBase

The base dataloader class for link prediction tasks.

DataLoaders

GSgnnNodeDataLoader

Mini-batch dataloader for node tasks.

GSgnnNodeSemiSupDataLoader

Semi-supervised mini-batch dataloader for node tasks.

GSgnnEdgeDataLoader

The mini-batch dataloader for edge prediction tasks.

GSgnnLinkPredictionDataLoader

Mini-batch dataloader for link prediction.

GSgnnLinkPredictionTestDataLoader

Mini-batch dataloader for link prediction validation and test.

GSgnnLinkPredictionPredefinedTestDataLoader

Mini-batch dataloader for link prediction validation and test with predefined negatives.