:py:mod:`astronomer.providers.google.cloud.triggers.dataproc` ============================================================= .. py:module:: astronomer.providers.google.cloud.triggers.dataproc Module Contents --------------- Classes ~~~~~~~ .. autoapisummary:: astronomer.providers.google.cloud.triggers.dataproc.DataprocCreateClusterTrigger astronomer.providers.google.cloud.triggers.dataproc.DataprocDeleteClusterTrigger astronomer.providers.google.cloud.triggers.dataproc.DataProcSubmitTrigger .. py:class:: DataprocCreateClusterTrigger(*, project_id = None, region = None, cluster_name, end_time, metadata = (), delete_on_error = True, cluster_config = None, labels = None, gcp_conn_id = 'google_cloud_default', polling_interval = 5.0, **kwargs) Bases: :py:obj:`airflow.triggers.base.BaseTrigger` Asynchronously check the status of a cluster :param project_id: The ID of the Google Cloud project the cluster belongs to :param region: The Cloud Dataproc region in which to handle the request :param cluster_name: The name of the cluster :param end_time: Time in second left to check the cluster status :param metadata: Additional metadata that is provided to the method :param gcp_conn_id: The connection ID to use when fetching connection info. :param polling_interval: Time in seconds to sleep between checks of cluster status .. py:method:: serialize() Serializes DataprocCreateClusterTrigger arguments and classpath. .. py:method:: run() :async: Check the status of cluster until reach the terminal state .. py:class:: DataprocDeleteClusterTrigger(cluster_name, end_time, project_id = None, region = None, metadata = (), gcp_conn_id = 'google_cloud_default', polling_interval = 5.0, **kwargs) Bases: :py:obj:`airflow.triggers.base.BaseTrigger` Asynchronously check the status of a cluster :param cluster_name: The name of the cluster :param end_time: Time in second left to check the cluster status :param project_id: The ID of the Google Cloud project the cluster belongs to :param region: The Cloud Dataproc region in which to handle the request :param metadata: Additional metadata that is provided to the method :param gcp_conn_id: The connection ID to use when fetching connection info. :param polling_interval: Time in seconds to sleep between checks of cluster status .. py:method:: serialize() Serializes DataprocDeleteClusterTrigger arguments and classpath. .. py:method:: run() :async: Wait until cluster is deleted completely .. py:class:: DataProcSubmitTrigger(*, dataproc_job_id, region = None, project_id = None, gcp_conn_id = 'google_cloud_default', polling_interval = 5.0) Bases: :py:obj:`airflow.triggers.base.BaseTrigger` Check for the state of a previously submitted Dataproc job. :param dataproc_job_id: The Dataproc job ID to poll. (templated) :param region: Required. The Cloud Dataproc region in which to handle the request. (templated) :param project_id: The ID of the google cloud project in which to create the cluster. (templated) :param location: (To be deprecated). The Cloud Dataproc region in which to handle the request. (templated) :param gcp_conn_id: The connection ID to use connecting to Google Cloud Platform. :param wait_timeout: How many seconds wait for job to be ready. .. py:method:: serialize() Serializes DataProcSubmitTrigger arguments and classpath. .. py:method:: run() :async: Simple loop until the job running on Google Cloud DataProc is completed or not