MaGGIe introduces the I-HIM50K and M-HIM2K datasets, featuring over 180,000 synthesized human masks to evaluate instance matting robustness.MaGGIe introduces the I-HIM50K and M-HIM2K datasets, featuring over 180,000 synthesized human masks to evaluate instance matting robustness.

Synthesizing Multi-Instance Human Matting Data with MaskRCNN and BG20K

2025/12/20 02:30
3 min read
For feedback or concerns regarding this content, please contact us at crypto.news@mexc.com

Abstract and 1. Introduction

  1. Related Works

  2. MaGGIe

    3.1. Efficient Masked Guided Instance Matting

    3.2. Feature-Matte Temporal Consistency

  3. Instance Matting Datasets

    4.1. Image Instance Matting and 4.2. Video Instance Matting

  4. Experiments

    5.1. Pre-training on image data

    5.2. Training on video data

  5. Discussion and References

\ Supplementary Material

  1. Architecture details

  2. Image matting

    8.1. Dataset generation and preparation

    8.2. Training details

    8.3. Quantitative details

    8.4. More qualitative results on natural images

  3. Video matting

    9.1. Dataset generation

    9.2. Training details

    9.3. Quantitative details

    9.4. More qualitative results

8. Image matting

This section expands on the image matting process, providing additional insights into dataset generation and comprehensive comparisons with existing methods. We delve into the creation of I-HIM50K and M-HIM2K datasets, offer detailed quantitative analyses, and present further qualitative results to underscore the effectiveness of our approach.

8.1. Dataset generation and preparation

The I-HIM50K dataset was synthesized from the HHM50K [50] dataset, which is known for its extensive collection of human image mattes. We employed a MaskRCNN [14] Resnet-50 FPN 3x model, trained on the COCO dataset, to filter out single-person images, resulting in a subset of 35,053 images. Following the InstMatt [49] methodology, these images were composited against diverse backgrounds from the BG20K [29] dataset, creating multi-instance scenarios with 2-5 subjects per image. The subjects were resized and positioned to maintain a realistic scale and avoid excessive overlap, as indicated by instance IoUs not exceeding 30%. This process yielded 49,737 images, averaging 2.28 instances per image. During training, guidance masks were generated by binarizing the alpha mattes and applying random dropout, dilation, and erosion operations. Sample images from I-HIM50K are displayed in Fig. 10.

\ The M-HIM2K dataset was designed to test model robustness against varying mask qualities. It comprises ten masks per instance, generated using various MaskRCNN models. More information about models used for this generation process is shown in Table 8. The masks were matched to instances based on the highest IoU with the ground truth alpha mattes, ensuring a minimum IoU threshold of 70%. Masks that did not meet this threshold were artificially generated from ground truth. This process resulted in a comprehensive set of 134,240 masks, with 117,660 for composite and 16,600 for natural images, providing a robust benchmark for evaluating masked guided instance matting. The full dataset I-HIM50K and M-HIM2K will be released after the acceptance of this work.

\ Figure 10. Examples of I-HIM50K dataset. (Best viewed in color).

\ Table 8. Ten models with vary mask quality are used in MHIM2K. The MaskRCNN models are from detectron2 trained on COCO with different settings.

\

:::info Authors:

(1) Chuong Huynh, University of Maryland, College Park (chuonghm@cs.umd.edu);

(2) Seoung Wug Oh, Adobe Research (seoh,jolee@adobe.com);

(3) Abhinav Shrivastava, University of Maryland, College Park (abhinav@cs.umd.edu);

(4) Joon-Young Lee, Adobe Research (jolee@adobe.com).

:::


:::info This paper is available on arxiv under CC by 4.0 Deed (Attribution 4.0 International) license.

:::

\

Market Opportunity
MemeCore Logo
MemeCore Price(M)
$1.51193
$1.51193$1.51193
+1.08%
USD
MemeCore (M) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact crypto.news@mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Bitcoin ETFs Surge with 20,685 BTC Inflows, Marking Strongest Week

Bitcoin ETFs Surge with 20,685 BTC Inflows, Marking Strongest Week

TLDR Bitcoin ETFs recorded their strongest weekly inflows since July, reaching 20,685 BTC. U.S. Bitcoin ETFs contributed nearly 97% of the total inflows last week. The surge in Bitcoin ETF inflows pushed holdings to a new high of 1.32 million BTC. Fidelity’s FBTC product accounted for 36% of the total inflows, marking an 18-month high. [...] The post Bitcoin ETFs Surge with 20,685 BTC Inflows, Marking Strongest Week appeared first on CoinCentral.
Share
Coincentral2025/09/18 02:30
Today’s NYT Pips Hints And Solutions For Thursday, September 18th

Today’s NYT Pips Hints And Solutions For Thursday, September 18th

The post Today’s NYT Pips Hints And Solutions For Thursday, September 18th appeared on BitcoinEthereumNews.com. It’s Thursday and I am incredibly sore and tired after really hitting the weights and the yoga mat hard this week. Sore is good! It takes pain to reduce pain, or at least that’s my experience with exercise. We must exercise our minds as well, and what better way to do that than with a fun puzzle game about placing dominoes in the correct tiles. Come along, my Pipsqueaks, let’s solve today’s Pips! Looking for Wednesday’s Pips? Read our guide right here. How To Play Pips In Pips, you have a grid of multicolored boxes. Each colored area represents a different “condition” that you have to achieve. You have a select number of dominoes that you have to spend filling in the grid. You must use every domino and achieve every condition properly to win. There are Easy, Medium and Difficult tiers. Here’s an example of a difficult tier Pips: Pips example Screenshot: Erik Kain As you can see, the grid has a bunch of symbols and numbers with each color. On the far left, the three purple squares must not equal one another (hence the equal sign crossed out). The two pink squares next to that must equal a total of 0. The zig-zagging blue squares all must equal one another. You click on dominoes to rotate them, and will need to since they have to be rotated to fit where they belong. Not shown on this grid are other conditions, such as “less than” or “greater than.” If there are multiple tiles with > or < signs, the total of those tiles must be greater or less than the listed number. It varies by grid. Blank spaces can have anything. The various possible conditions are: = All pips must equal one another in this group. ≠ All pips…
Share
BitcoinEthereumNews2025/09/18 08:59
Vitalik Buterin to Ethereum Developers: Build It Like It Has to Last Without You

Vitalik Buterin to Ethereum Developers: Build It Like It Has to Last Without You

Key Takeaways Vitalik Buterin wants Ethereum apps built to survive without developers, corporate servers, or trusted third parties Two major […] The post Vitalik
Share
Coindoo2026/03/07 15:49