weekly useful materials -06/22-

GENZITSU / UsefulMaterials

34 stars 0 forks source link

weekly useful materials -06/22- #57

Open GENZITSU opened 3 years ago

GENZITSU commented 3 years ago

dockerを介すとアプリのパーフォマンスが下がる事例の紹介

Docker のパフォーマンス劣化は実測で

1CPUの場合-27%、Security OFFで-20% 2CPUの場合-28%、Security OFFで-23%

このパフォーマンス比較を詳しく見ると ①ファイル読み書き=-47% ②Shell処理(CPU性能とか表す)=-45% ③プロセス生成=-51%←これがDockerのSecurityをOFFにすると-21%と大幅改善と全体より気になるマイナス幅にも気づきます

感想

機械学習エンジニア的に気になるのはファイルの読み書きのところ。ファイルの読み書き性能が低いとあるが、これはホスト側のファイルを読み取るときに遅いんだろうか？
(バインドマウントとボリュームマウントでも違いが出そう :thinking: )

あんまり遅いと学習のボトルネックになりそうなので、素インスタンス上で環境構築したほうがいいのかなぁ？

出典

元ツイート

検証記事

GENZITSU commented 3 years ago

新しい Detectron2 Mask R-CNN baseline from FAIR

Simple Copy-Paste Data Augmentationの再現実験を通じて、物体検出の精度向上をもたらすtipsを発見。

We conducted a series of ablation experiments to understand which hyperparameter changes drove these improvements. To see whether we can drive accuracy even higher, we also tried deeper models with larger images. Our experiments demonstrated that:

長い時間をかけてトレーニングを実施し、image sizeを大きくすることでBox AP, MaskAPは向上するが、scale jitterは大きくしすぎてもだめ。

A longer training schedule, larger input image size, and a larger scale jitter range have positive effects on AP. Box AP and Mask AP continued to scale with increases in training schedule (as shown in the chart above). Box AP and Mask AP plateaued for scale jitter at 0.5–1.6 when trained with 144 epoch schedule (as shown in the chart below).

Sync Batch Norm, weight Decay, DeepなBox headは共に精度向上に寄与する

Sync Batch Norm, Weight Decay, and deeper Region Proposal Network (RPN) and Region of Interest (ROI) heads also have a positive impact on Box AP and Mask AP, as shown in the table below.

AMPは学習速度を30%高速化させるが、degradeは限定的

Enabling PyTorch’s automatic mixed precision (AMP) and FP16 improved training speed by 30 percent and does not degrade Box AP and Mask AP. These performance gains were on an eight-node cluster, where each node had eight Nvidia V100 32GB GPUs.

感想

ちょろっと書かれ血えるが、ImageNet Initializationよりもrandom Initializationの方が精度が高いのが驚き。 Copy-Pasteで生成されるクリーンなラベルの影響だろうか...?

諸々のテクは機会があれば使いたい。

GENZITSU / UsefulMaterials