SAR Target Classification Based on full convolution network

SAR Target Classification Based on full convolution network

1. MSTAR dataset expansion

  • In the original SAR image (128) × 128) random clipping 88 × 88 slices, each slice can contain the target area, and the number of samples can reach (128-88 + 1) after random sampling × (128-88 + 1) = 1681 times, each SAR image of each category samples 10 slices, so as to improve the number of training samples.

2. Full convolution network model (AConvNets)

  • The network is completely realized by the convolution layer, the full connection layer is removed, the convolution output of the last layer directly outputs the probability of each category by Softmax, and the loss function is still the loss of classification cross entropy. The specific network model is as follows:

  • Among them, the convolution layer uses the BN layer to prevent over fitting and accelerate training. The input training set dimension: [b, 88, 88, 1] - > output: [b, 1, 1, 10]. The classification cross entropy loss is still used for training. The optimizer Adam has a learning rate of 0.0001.

3. Classification results under SOC conditions

  • The training set and test set are as follows:

    categoryquantity
    Training set 17 °Test set 15 °
    2S1299274
    BMP2233196
    BRDM2298274
    BTR60256195
    BTR70233196
    D7299274
    T62299273
    T72232196
    ZIL131299274
    ZSU23/4299274
    total27472426
  • Each SAR image in the training set is cropped with 5 random slices and 1 slice in the center, all of which are 88 × 88 pixels, a total of 2747 training sets × 6 = 16482 (sheets). Test set center clipping 88 × 88 slices.

  • Training: epoch = 50, Adam optimizer, learning_rate = 0.0001, the Accuracy and Loss curves of training set and test set are displayed in the tensorboard:

  • Calculate the confusion matrix of the test set as follows:

     	 2S1 BMP2 BRDM B60 B70 D7 T62 T72 ZIL ZSU	
    2S1[[263   0   0   0   0   0  11   0   0   0]	95.99%
    BMP2[  0 132   0  18   3   0   0  42   0   1]	67.35%
    BRDM[  0   0 272   0   0   0   0   0   0   2]	99.27%
    B60 [  0   0   0 192   0   0   0   0   1   2]	98.46%
    B70 [  0   0   0  13 181   0   0   2   0   0]	92.35%
    D7 	[  2   0   0   0   0 272   0   0   0   0]	99.27%
    T62 [  0   0   0   0   0   0 273   0   0   0]	100%
    T72 [  0   0   0   1   0   0   0 195   0   0]	99.49%
    ZIL [  0   0   0   0   0   4   0   0 270   0]	98.54%
    ZSU [  0   0   0   0   0   0   0   0   0 274]]	100%
    Total: 95.07% 
    The Precision is :  0.9574354589252246
    The Recall is :  0.9507109053615832
    The Accuracy is :  0.9579554822753503
    The F1 is :  0.9493873742571258
    The F_beta is :  0.949223632328047
    The Auc Score is :  0.9996171502953114
    

4. Classification results under eoc1

  • EOC1 is the training set and test set under the condition of large elevation change, as follows:

    categoryquantityquantity
    Training set 17 °Test set 30 °
    2S1299288
    BRDM2298287
    T72299288
    ZSU234299288
    total11951151
  • Each SAR image in the training set is cropped with 5 random slices and 1 slice in the center, all of which are 88 × 88 pixels, 1195 training sets in total × 6 = 7170 (sheets). Test set center clipping 88 × 88 slices.

  • Training: epoch200, Adam optimizer, learning_rate = 0.0001, the Accuracy and Loss curves of training set and test set are displayed in the tensorboard:
    [external chain picture transfer failed. The source station may have anti-theft chain mechanism. It is recommended to save the picture and upload it directly (img)-

  • Calculate the confusion matrix of the test set as follows:

         2S1  BRDM  T72  ZSU	
    2S1   288   0    0    0		100%
    BRDM   0   287   0    0		100%
    T72    0    0   287   1		99.65%
    ZSU   59    0    0   229	79.51%
    Total: 94.79% 
    
    The CNN Precision is :  0.9564058388673098
    The CNN Recall is :  0.9479166666666666
    The CNN Accuracy is :  0.947871416159861
    The CNN F1 is :  0.9473793419770824
    The CNN F_beta is :  0.9465925313694183
    

5. Classification results under eoc2 conditions

  • EOC2 is the change of vehicle appearance configuration, and its training set is as follows:

    categoryQuantity (pitch angle 17 °)
    BMP2(9563)233
    BRDM2298
    BTR70(C71)233
    T72(132)232
    total995
  • Test set:

    categorymodelQuantity (pitch angle 17 ° & 15 °)
    T72S7419
    T72A32572
    T72A62573
    T72A63573
    T72A64573
    total2710
  • Each SAR image in the training set is cropped with 10 random slices and 1 slice in the center, all of which are 88 × 88 pixels, 995 training sets in total × 11 = 10945 (sheets). Test set center clipping 88 × 88 slices.

  • Training: epoch50, Adam optimizer, learning_rate = 0.0001, the Accuracy and Loss curves of training set and test set are displayed in the tensorboard:

  • Calculate the confusion matrix of the test set as follows:

    	BMP2  BRDM2  BTR70   T72  
    S7	 8		0	   0     411	98.09%
    A32	 0      0      0     572	100%
    A62  1      0      0     572	99.83%
    A63  1      0      0     572	99.83%
    A64  4      0      0     569	96.14%
    Total: 98.78%
    

6. Classification results under eoc3

  • EOC3 is for different models and variants of the same target. The training set is consistent with EOC2 training set. The test set is as follows:

    categorymodelQuantity (pitch angle 17 ° & 15 °)
    BMP29566428
    BMP2C21429
    T72812426
    T72A04573
    T72A05573
    T72A07573
    T72A10567
    total3569
  • Here, the EOC2 trained model is directly used for prediction. The confusion matrix of the test set is as follows:

    		   BMP2   BRDM2    BTR70   T72  
    BMP9566	   359		0		17	    52		83.88%
    BMPC21	   372		0		1		2		86.71%
    T72_812		2		0		1	   423		99.30%
    T72_A04		3		0		0	   570		99.48%
    T72_A05		1		0		0	   572		99.83%
    T72_A07		0		0		0	   573		100%
    T72_A10		20		0		0	   547		96.47%		
    Total: 95.74%
    

Keywords: Computer Vision Deep Learning convolution Object Detection

Added by salih0vicX on Mon, 20 Dec 2021 07:29:15 +0200