aws glue create-crawler

Creates a new crawler with specified targets, role, configuration, and optional schedule. At least one crawl target must be specified, in the s3Targets field, the jdbcTargets field, or the DynamoDBTargets field

Options

NameDescription
--name <string>Name of the new crawler
--role <string>The IAM role or Amazon Resource Name (ARN) of an IAM role used by the new crawler to access customer resources
--database-name <string>The AWS Glue database where results are written, such as: arn:aws:daylight:us-east-1::database/sometable/*
--description <string>A description of the new crawler
--targets <structure>A list of collection of targets to crawl
--schedule <string>A cron expression used to specify the schedule (see Time-Based Schedules for Jobs and Crawlers. For example, to run something every day at 12:15 UTC, you would specify: cron(15 12 * * ? *)
--classifiers <list>A list of custom classifiers that the user has registered. By default, all built-in classifiers are included in a crawl, but these custom classifiers always override the default classifiers for a given classification
--table-prefix <string>The table prefix used for catalog tables that are created
--schema-change-policy <structure>The policy for the crawler's update and deletion behavior
--recrawl-policy <structure>A policy that specifies whether to crawl the entire dataset again, or to crawl only folders that were added since the last crawler run
--lineage-configuration <structure>Specifies data lineage configuration settings for the crawler
--configuration <string>Crawler configuration information. This versioned JSON string allows users to specify aspects of a crawler's behavior. For more information, see Configuring a Crawler
--crawler-security-configuration <string>The name of the SecurityConfiguration structure to be used by this crawler
--tags <map>The tags to use with this crawler request. You may use tags to limit access to the crawler. For more information about tags in AWS Glue, see AWS Tags in AWS Glue in the developer guide
--cli-input-json <string>Performs service operation based on the JSON string provided. The JSON string follows the format provided by ``--generate-cli-skeleton``. If other arguments are provided on the command line, the CLI values will override the JSON-provided values. It is not possible to pass arbitrary binary values using a JSON-provided value as the string will be taken literally
--generate-cli-skeleton <string>Prints a JSON skeleton to standard output without sending an API request. If provided with no value or the value ``input``, prints a sample input JSON that can be used as an argument for ``--cli-input-json``. If provided with the value ``output``, it validates the command inputs and returns a sample output JSON for that command