cuery.actors.utils#

Attributes#

Functions#

fetch_apify_dataset(source, id[, force_cloud])

Fetch a dataset from Apify and return it as a DataFrame.

fetch_parquet_dataset(url[, columns])

Fetch a Parquet dataset from a URL and return it as a DataFrame.

run_flex_tool(Actor, Tool, **kwargs)

Run a flex tool with the given arguments.

Module Contents#

cuery.actors.utils.ActorClass#
cuery.actors.utils.FlexToolClass#
cuery.actors.utils.DEFAULT_FLEX_CONFIG#
async cuery.actors.utils.fetch_apify_dataset(source, id, force_cloud=True)#

Fetch a dataset from Apify and return it as a DataFrame.

Parameters:
  • source (ActorClass | apify_client.ApifyClientAsync)

  • id (str)

  • force_cloud (bool)

Return type:

pandas.DataFrame

cuery.actors.utils.fetch_parquet_dataset(url, columns=None)#

Fetch a Parquet dataset from a URL and return it as a DataFrame.

Since we can’t validate column names before fetching, we try to read the dataset with and without specifying them.

Parameters:
  • url (str)

  • columns (list[str] | None)

Return type:

pandas.DataFrame

async cuery.actors.utils.run_flex_tool(Actor, Tool, **kwargs)#

Run a flex tool with the given arguments.

Parameters:
  • Actor (ActorClass)

  • Tool (FlexToolClass)