🏆Won 3rd Position at Distributed Hack Berlin by @exalsius and @flwrlabs on 14th - 15th November at Einstein Digital Center, Berlin. I have provided all the details about the model built below in the thread 🧵
We worked on the Track 1 of the Hackathon which was to train the model on the NIH Lung Dataset which is been distributed around three hospital in the scenario and the model needs to be trained particularly to that hospital and then have the aggregated model by using the federated learning approach using flowerlabs. The major constraint factor was of that each job of training will run only for 20 mins.
I built the EfficientNet-b0 with the pretrained network on the ImageNet 1K, as the focus point was to get the high accuracy in the less duration of the training time, the approaches used for building the model was Learning rate scheduler of 15% linear warmup + cosine decay (min_lr=0.05). The starting with a small lr so training doesn’t explode early then slowly reduce lr so the model trains gently at the end as per the cosine decay. This approach stabilize the early training and prevents the swings in the lr that could destabilize the tuning.
I did the approach of preserving the pretrained filers structures while adapting the single channel input. Also, made sure of the fast learning of the new head with the careful bias bound. The major thing which was to handle the catastrophic forgetting so I assigned the large lr for the new classifier and small lr for the pretrained backbone. Also, Configured the Autocast and the GradScalar which eventually helped to halve the memory and double the throughput. Mixed precision training (FP16) with GradScaler was used. To also maintain the stability and prevent from the exploiding gradients, the approach applied was to have the stronger clipping during the warmup and then hte lower clipping later on. I did consider that most of the medical records are in imbalance in the nature so I applied the Focal Loss (gamma=2.0, alpha=1.0) with class weighting.
‏‎7.91 ألف‏
‏‎16‏
المحتوى الوارد في هذه الصفحة مُقدَّم من أطراف ثالثة. وما لم يُذكَر خلاف ذلك، فإن OKX ليست مُؤلِّفة المقالة (المقالات) المذكورة ولا تُطالِب بأي حقوق نشر وتأليف للمواد. المحتوى مٌقدَّم لأغراض إعلامية ولا يُمثِّل آراء OKX، وليس الغرض منه أن يكون تأييدًا من أي نوع، ولا يجب اعتباره مشورة استثمارية أو التماسًا لشراء الأصول الرقمية أو بيعها. إلى الحد الذي يُستخدَم فيه الذكاء الاصطناعي التوليدي لتقديم مُلخصَّات أو معلومات أخرى، قد يكون هذا المحتوى الناتج عن الذكاء الاصطناعي غير دقيق أو غير مُتسِق. من فضلك اقرأ المقالة ذات الصِلة بهذا الشأن لمزيدٍ من التفاصيل والمعلومات. OKX ليست مسؤولة عن المحتوى الوارد في مواقع الأطراف الثالثة. والاحتفاظ بالأصول الرقمية، بما في ذلك العملات المستقرة ورموز NFT، فيه درجة عالية من المخاطر وهو عُرضة للتقلُّب الشديد. وعليك التفكير جيِّدًا فيما إذا كان تداوُل الأصول الرقمية أو الاحتفاظ بها مناسبًا لك في ظل ظروفك المالية.