Expose `multimask_output` in `EfficientSam.forward` #30

MichaelFishmanBDAII · 2023-12-19T17:52:57Z

This would involve adding the optional parameter to EfficientSam.forward, and then passing it to EfficientSam.predict_masks.

    def forward(
        self,
        batched_images: torch.Tensor,
        batched_points: torch.Tensor,
        batched_point_labels: torch.Tensor,
        scale_to_original_image_size: bool = True,
        multimask_output: bool = True
    ) -> Tuple[torch.Tensor, torch.Tensor]:
        """
        Predicts masks end-to-end from provided images and prompts.
        If prompts are not known in advance, using SamPredictor is
        recommended over calling the model directly.

        Arguments:
          batched_images: A tensor of shape [B, 3, H, W]
          batched_points: A tensor of shape [B, num_queries, max_num_pts, 2]
          batched_point_labels: A tensor of shape [B, num_queries, max_num_pts]
          multimask_output: If True, generate multiple masks for each query. Otherwise, generate one mask per query.

        Returns:
          A list tuples of two tensors where the ith element is by considering the first i+1 points.
            low_res_mask: A tensor of shape [B, 256, 256] of predicted masks
            iou_predictions: A tensor of shape [B, max_num_queries] of estimated IOU scores
        """
        batch_size, _, input_h, input_w = batched_images.shape
        image_embeddings = self.get_image_embeddings(batched_images)
        return self.predict_masks(
            image_embeddings,
            batched_points,
            batched_point_labels,
            multimask_output=multimask_output,
            input_h=input_h,
            input_w=input_w,
            output_h=input_h if scale_to_original_image_size else -1,
            output_w=input_w if scale_to_original_image_size else -1,
        )

I'm happy to make a PR for this, but I figure it may be easier to just throw this in as part of the ongoing updates you all are making.

Thanks for releasing this code and updating it so frequently!

Edit: I tried the code above, and using multimask_output=False seems to be giving me broken masks, so I'm probably missing something and this may be more involved than I'd thought. The bug could also be in my postprocessing code.

For the dog example image and points, this is what I get with, and without multimask:

The text was updated successfully, but these errors were encountered:

yformer · 2023-12-20T01:23:39Z

@MichaelFishmanBDAII, thanks for your interest! We will take a look.

MichaelFishmanBDAII · 2023-12-26T16:31:48Z

The predicted IOU for the single-mask mode masks tends to be very low (1e-5), so I don't think the problem is how I'm unpacking the masks.

I also tried using bounding box prompts instead of point prompts, since the original SAM paper says single mask mode was designed for multi prompt mask generation, but this still gave me the diffuse, grid-like masks that the point prompts gave me.

feivellau · 2024-01-09T09:03:29Z

I'm experiencing the same problem. When I set multimask_output=False to get a single mask, the mask results are very poor!

feivellau mentioned this issue Jan 11, 2024

Regarding the issue of EfficientSAM of checkpoints #36

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose `multimask_output` in `EfficientSam.forward` #30

Expose `multimask_output` in `EfficientSam.forward` #30

MichaelFishmanBDAII commented Dec 19, 2023 •

edited

Loading

yformer commented Dec 20, 2023

MichaelFishmanBDAII commented Dec 26, 2023

feivellau commented Jan 9, 2024

Expose multimask_output in EfficientSam.forward #30

Expose multimask_output in EfficientSam.forward #30

Comments

MichaelFishmanBDAII commented Dec 19, 2023 • edited Loading

yformer commented Dec 20, 2023

MichaelFishmanBDAII commented Dec 26, 2023

feivellau commented Jan 9, 2024

Expose `multimask_output` in `EfficientSam.forward` #30

Expose `multimask_output` in `EfficientSam.forward` #30

MichaelFishmanBDAII commented Dec 19, 2023 •

edited

Loading