This article details the selection and implementation of classic and SOTA incremental learning methods for benchmarking the new Instance-Incremental Learning setting.This article details the selection and implementation of classic and SOTA incremental learning methods for benchmarking the new Instance-Incremental Learning setting.

CIL Methods Fail IIL: Why Existing Baselines Struggle with Model Promotion

2025/11/12 23:00
5 min read
For feedback or concerns regarding this content, please contact us at crypto.news@mexc.com

Abstract and 1 Introduction

  1. Related works

  2. Problem setting

  3. Methodology

    4.1. Decision boundary-aware distillation

    4.2. Knowledge consolidation

  4. Experimental results and 5.1. Experiment Setup

    5.2. Comparison with SOTA methods

    5.3. Ablation study

  5. Conclusion and future work and References

    \

Supplementary Material

  1. Details of the theoretical analysis on KCEMA mechanism in IIL
  2. Algorithm overview
  3. Dataset details
  4. Implementation details
  5. Visualization of dusted input images
  6. More experimental results

10. Implementation details

\ Selection of compared methods. As few existing method is proposed for the IIL setting, we reproduce several classic and SOTA CIL methods by referring to their original code or paper with the minimum revision.

\ LwF [12] is one of the earliest incremental learning algorithms based on deep learning, which propose to use knowledge distillation for resisting knowledge forgetting. Considering the significance of this method, a lot of CIL methods are still comparing with this baseline.

\ iCarl [22] is base on the LwF method and propose to use old exemplars for label-level knowledge distillation.

\ PODNet [4] implements old knowledge distillation at the feature level, which is different from the former two.

\ Der [31] which expends the network dynamically and attains the best CIL results given task id. Expending the neural network shows great power in learning new knowledge and retain old knowledge. Although in new IIL setting the adding of new parameters is limited, we are glad to know the performance of the method with dynamic network in the new IIL setting.

\ OnPro [29] uses online prototypes to enhance the existing boundaries with only the data visible at the current time step, which satisfies our setting without old data. Their motion to make the learned feature more generalizable to new tasks is also consist with our method to promote the model continually utilizing only new data.

\ online learning [6] can be applied to the hybrid-incremental learning. Thus, it can be implemented directly in the new IIL setting. It proposes a modified cross-distillation by smoothing student predictions with teacher predictions for old knowledge retaining, which is different with our method to alter the learning target by fusing annotated label with the teacher predictions.

\ Besides above CIL methods, ISL [13] is one of the scarce IIL methods that can be directly implemented in our IIL setting. Different from our setting that aims to address all newly achieved instances, ISL is proposed for incremental sub-population learning. Hence, we not only applied ISL in our setting for comparative purposes but also evaluated ours following their setting.

\ Our extensive experiments real that neither existing CIL methods nor IIL methods can tame the proposed IIL learning problem. Existing methods, especially the CIL methods, primarily concentrate on mitigating catastrophic forgetting, demonstrating limited effectiveness in learning from new data. In real-world applications, enhancing the model with additional instances and achieving more generalizable features is crucial. This underscores the importance and relevance of our proposed IIL setting.

\ Details for reproduction of compared methods. To reproduce existing rehearsal-based methods, including iCarl [22], PODNet [4], Der [31], OnPro [29], although no old data is available in the new IIL setting, we still set a memory of 20 exemplars per class. For all compared methods, we trained the base model for their own if necessary. For example, some of the methods utilize more comprehensive data augmentation methods and have a higher base performance than others. We kept the data augmentation part in reproducing these methods, even in the IIL learning phase.

\ For LwF [12], as there is no need to add new classification head, we directly implement their training loss on the old classification head. iCarl [22] is implemented using their provided codes on Tensorflow. PODNet [4] and Der [31] are implemented based on their own codes using Pytorch. For online learning [6], we reproduce it only using the part that is related to learning from new instances of old classes in the first training phase. ISL [13] is also implemented based on its original codes with the original setting of the learning rate, i.e. lr=0.05 for the base training and lr=0.005 for the IIL learning phase. OnPro [29] utilize a quite strong data augmentation during training, which leads to a slightly low accuracy of the based model compared to other methods. As OnPro trains the base model for

\ Figure 10. The original images and corresponding dusted images.

\ only one epoch in the online training, which is not sufficient to achieve a strong base model, we change it to 60 epochs for training the base model as other methods. For the IIL learning phase, we keep the one epoch setting as it is because we found training more epochs don’t lead to better performance. The learning rate of OnPro used in training the based model is 5e-4 and decays with a ratio of 0.1 at epoch 12, 30, 50. In the IIL traing phase, lr=5e-7, which is far smaller than ours (0.01).

\

:::info Authors:

(1) Qiang Nie, Hong Kong University of Science and Technology (Guangzhou);

(2) Weifu Fu, Tencent Youtu Lab;

(3) Yuhuan Lin, Tencent Youtu Lab;

(4) Jialin Li, Tencent Youtu Lab;

(5) Yifeng Zhou, Tencent Youtu Lab;

(6) Yong Liu, Tencent Youtu Lab;

(7) Qiang Nie, Hong Kong University of Science and Technology (Guangzhou);

(8) Chengjie Wang, Tencent Youtu Lab.

:::


:::info This paper is available on arxiv under CC BY-NC-ND 4.0 Deed (Attribution-Noncommercial-Noderivs 4.0 International) license.

:::

\

World Cup Combo: Aim for 200x

World Cup Combo: Aim for 200xWorld Cup Combo: Aim for 200x

Combine up to 20 World Cup matches in one order

Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact crypto.news@mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Q2 Market Insights: Bitcoin regains dominance in risk-averse environment, ETFs remain critical to market structure

Q2 Market Insights: Bitcoin regains dominance in risk-averse environment, ETFs remain critical to market structure

The market will show a downward trend in the short term, and then rebound and set new highs in the second half of the year.
Share
PANews2025/04/28 19:40
Brittany Force Reflects On A NHRA Career At Full Throttle And What Comes Next

Brittany Force Reflects On A NHRA Career At Full Throttle And What Comes Next

The post Brittany Force Reflects On A NHRA Career At Full Throttle And What Comes Next appeared on BitcoinEthereumNews.com. GAINESVILLE, FL – MARCH 11: Brittany Force (#1 Monster Energy Top Fuel Dragster) watches her crew prepare her dragster during qualfying for the AMALIE Motor Oil NHRA Gatornationals on March 11, 2023 at Gainesville Raceway in Gainesville, FL. (Photo by Jeff Robinson/Icon Sportswire via Getty Images) Icon Sportswire via Getty Images When Brittany Force commits to something, she floors it. After all you don’t become the fastest woman in NHRA history by easing into the throttle. For 13 of her 39 years, she’s strapped into a Top Fuel dragster and launched herself down quarter-mile dragstrips at over 330 miles an hour. That’s not a career you do halfway. But now, after two NHRA championships, a cabinet full of Wallys, record-setting runs, and a lifetime’s worth of moments, she’s turning the page to start a new chapter. That news isn’t new — Force announced that she would step away to start a family with her husband, Bobby, a few weeks ago. But what’s worth pausing on now isn’t the headline, it’s the meaning behind it. Force isn’t retiring so much as shifting gears, and taking time to reflect on a career she says has already given her more than she ever dreamed. “If I were to rewind back to 2013, my rookie season, I think I’ve accomplished more than I ever could have imagined,” Force says. “My big focus (then) was I just want to get a win. I wanted to stand in a winner’s circle with my team.” That focus became her eyes on the biggest prize, an NHRA Top Fuel title. Something her legendary father accomplished an astounding 16 times. Brittany earned not just one, but two NHRA Top Fuel titles. “I never imagined two championships ever,” she says. “And when that first one came, that’s really when it…
Share
BitcoinEthereumNews2025/09/21 23:37
Why Businesses Need Professional Machine Design and Development Services

Why Businesses Need Professional Machine Design and Development Services

In many industries, machines are the backbone of daily work. They help businesses speed up production, improve accuracy, and reduce manual effort. But building
Share
Techbullion2026/04/02 17:54