[modular] add auto_docstring & more doc related refactors #12958

yiyixuxu · 2026-01-10T02:36:39Z

This PR adds a utility script utils/modular_auto_docstring.py that automatically generates docstrings for modular pipeline block classes from their doc property.

Usage

Mark classes with # auto_docstring comment:

# auto_docstring
class QwenImageAutoVaeEncoderStep(AutoPipelineBlocks):
    block_classes = [QwenImageInpaintVaeEncoderStep, QwenImageImg2ImgVaeEncoderStep]
    block_names = ["inpaint", "img2img"]
    block_trigger_inputs = ["mask_image", "image"]

    @property
    def doc(self):
        return (
            "Vae encoder step that encodes image inputs into latent representations.\n"
            "This is an auto pipeline block.\n"
            " - `QwenImageInpaintVaeEncoderStep` (inpaint) is used when `mask_image` is provided.\n"
            " - `QwenImageImg2ImgVaeEncoderStep` (img2img) is used when `image` is provided."
        )

Run the script to insert docstrings:

python utils/modular_auto_docstring.py --fix_and_overwrite

HuggingFaceDocBuilderDev · 2026-01-10T02:44:15Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

yiyixuxu · 2026-01-10T11:31:37Z

src/diffusers/modular_pipelines/modular_pipeline_utils.py

+    # ======================================================
+
+    @classmethod
+    def prompt(cls) -> "InputParam":


our pipeline parameter are pretty consistent across different pipelines, e.g. you always have prompt, height, width, num_inference_steps, etc. I made template for these common ones, so that it is easier to define

before you need

InputParam( name="prompt", type_hint=str, required=True, description="The prompt or prompts to guide image generation." )

now you do

InputParam.prompt() InputParam.height(default=1024) InputParam.num_inference_steps(default=28) InputParam.generator()

I'm a bit apprehensive about introducing dedicated class methods for common parameters in this way. I think the class can become quite large as common inputs expand.

I would prefer to keep current syntax (IMO this ensures InputParams are defined in a consistent way) and use post init on the dataclass to automatically add a description. e.g

# centralised descriptions would live somewhere like constants.py # can be used for modular + non-modular INPUT_PARAM_TEMPLATES = { "prompt": {"type_hint": str, "required": True, "description": "The prompt or prompts to guide image generation."}, "height": {"type_hint": int, "description": "The height in pixels of the generated image."}, "width": {"type_hint": int, "description": "The width in pixels of the generated image."}, "generator": {"type_hint": torch.Generator, "description": "Torch generator for deterministic generation."}, # ... } @dataclass class InputParam: name: str = None type_hint: Any = None required: bool = False default: Any = None description: str = None def __post_init__(self): if not self.name or self.name not in INPUT_PARAM_TEMPLATES: return template = INPUT_PARAM_TEMPLATES[self.name] if self.type_hint is None: self.type_hint = template.get("type_hint") if self.description is None: self.description = template.get("description")

If we feel that methods for these inputs are necessary, one way to address it without adding individual methods to the InputParam is to use a metaclass. It would result in the InputParam object being less crowded.

class InputParamMeta(type): def __getattr__(cls, name: str): if name in INPUT_PARAM_TEMPLATES: def factory(**overrides): return cls(name=name, **overrides) return factory raise AttributeError(f"No template named '{name}'") @dataclass class InputParam(metaclass=InputParamMeta):

yiyixuxu · 2026-01-10T11:32:59Z