Item Pipelines#
- class kingfisher_scrapy.pipelines.Validate[source]#
Drops duplicate files based on
file_name
and file items based onfile_name
andnumber
.- Raises:
jsonschema.ValidationError – if the item is invalid
- class kingfisher_scrapy.pipelines.Sample[source]#
Drops items and closes the spider once the sample size is reached.