SAR Target Classification Based on full convolution network

1. MSTAR dataset expansion

In the original SAR image (128) × 128) random clipping 88 × 88 slices, each slice can contain the target area, and the number of samples can reach (128-88 + 1) after random sampling × (128-88 + 1) = 1681 times, each SAR image of each category samples 10 slices, so as to improve the number of training samples.

2. Full convolution network model (AConvNets)

The network is completely realized by the convolution layer, the full connection layer is removed, the convolution output of the last layer directly outputs the probability of each category by Softmax, and the loss function is still the loss of classification cross entropy. The specific network model is as follows:

Among them, the convolution layer uses the BN layer to prevent over fitting and accelerate training. The input training set dimension: [b, 88, 88, 1] - > output: [b, 1, 1, 10]. The classification cross entropy loss is still used for training. The optimizer Adam has a learning rate of 0.0001.

3. Classification results under SOC conditions

The training set and test set are as follows:

category quantity
Training set 17 ° Test set 15 °
2S1 299 274
BMP2 233 196
BRDM2 298 274
BTR60 256 195
BTR70 233 196
D7 299 274
T62 299 273
T72 232 196
ZIL131 299 274
ZSU23/4 299 274
total 2747 2426
Each SAR image in the training set is cropped with 5 random slices and 1 slice in the center, all of which are 88 × 88 pixels, a total of 2747 training sets × 6 = 16482 (sheets). Test set center clipping 88 × 88 slices.
Training: epoch = 50, Adam optimizer, learning_rate = 0.0001, the Accuracy and Loss curves of training set and test set are displayed in the tensorboard:

category	quantity
	Training set 17 °	Test set 15 °
2S1	299	274
BMP2	233	196
BRDM2	298	274
BTR60	256	195
BTR70	233	196
D7	299	274
T62	299	273
T72	232	196
ZIL131	299	274
ZSU23/4	299	274
total	2747	2426

Calculate the confusion matrix of the test set as follows:

 	 2S1 BMP2 BRDM B60 B70 D7 T62 T72 ZIL ZSU	
2S1[[263   0   0   0   0   0  11   0   0   0]	95.99%
BMP2[  0 132   0  18   3   0   0  42   0   1]	67.35%
BRDM[  0   0 272   0   0   0   0   0   0   2]	99.27%
B60 [  0   0   0 192   0   0   0   0   1   2]	98.46%
B70 [  0   0   0  13 181   0   0   2   0   0]	92.35%
D7 	[  2   0   0   0   0 272   0   0   0   0]	99.27%
T62 [  0   0   0   0   0   0 273   0   0   0]	100%
T72 [  0   0   0   1   0   0   0 195   0   0]	99.49%
ZIL [  0   0   0   0   0   4   0   0 270   0]	98.54%
ZSU [  0   0   0   0   0   0   0   0   0 274]]	100%
Total: 95.07% 
The Precision is :  0.9574354589252246
The Recall is :  0.9507109053615832
The Accuracy is :  0.9579554822753503
The F1 is :  0.9493873742571258
The F_beta is :  0.949223632328047
The Auc Score is :  0.9996171502953114

4. Classification results under eoc1

EOC1 is the training set and test set under the condition of large elevation change, as follows:

category quantity quantity
Training set 17 ° Test set 30 °
2S1 299 288
BRDM2 298 287
T72 299 288
ZSU234 299 288
total 1195 1151
Each SAR image in the training set is cropped with 5 random slices and 1 slice in the center, all of which are 88 × 88 pixels, 1195 training sets in total × 6 = 7170 (sheets). Test set center clipping 88 × 88 slices.
Training: epoch200, Adam optimizer, learning_rate = 0.0001, the Accuracy and Loss curves of training set and test set are displayed in the tensorboard:
[external chain picture transfer failed. The source station may have anti-theft chain mechanism. It is recommended to save the picture and upload it directly (img)-

category	quantity	quantity
	Training set 17 °	Test set 30 °
2S1	299	288
BRDM2	298	287
T72	299	288
ZSU234	299	288
total	1195	1151

Calculate the confusion matrix of the test set as follows:

     2S1  BRDM  T72  ZSU	
2S1   288   0    0    0		100%
BRDM   0   287   0    0		100%
T72    0    0   287   1		99.65%
ZSU   59    0    0   229	79.51%
Total: 94.79% 

The CNN Precision is :  0.9564058388673098
The CNN Recall is :  0.9479166666666666
The CNN Accuracy is :  0.947871416159861
The CNN F1 is :  0.9473793419770824
The CNN F_beta is :  0.9465925313694183

5. Classification results under eoc2 conditions

EOC2 is the change of vehicle appearance configuration, and its training set is as follows:

category Quantity (pitch angle 17 °)
BMP2(9563) 233
BRDM2 298
BTR70(C71) 233
T72(132) 232
total 995
Test set:

category model Quantity (pitch angle 17 ° & 15 °)
T72 S7 419
T72 A32 572
T72 A62 573
T72 A63 573
T72 A64 573
total 2710
Each SAR image in the training set is cropped with 10 random slices and 1 slice in the center, all of which are 88 × 88 pixels, 995 training sets in total × 11 = 10945 (sheets). Test set center clipping 88 × 88 slices.
Training: epoch50, Adam optimizer, learning_rate = 0.0001, the Accuracy and Loss curves of training set and test set are displayed in the tensorboard:

category	Quantity (pitch angle 17 °)
BMP2(9563)	233
BRDM2	298
BTR70(C71)	233
T72(132)	232
total	995

category	model	Quantity (pitch angle 17 ° & 15 °)
T72	S7	419
T72	A32	572
T72	A62	573
T72	A63	573
T72	A64	573
total		2710

Calculate the confusion matrix of the test set as follows:

	BMP2  BRDM2  BTR70   T72  
S7	 8		0	   0     411	98.09%
A32	 0      0      0     572	100%
A62  1      0      0     572	99.83%
A63  1      0      0     572	99.83%
A64  4      0      0     569	96.14%
Total: 98.78%

6. Classification results under eoc3

EOC3 is for different models and variants of the same target. The training set is consistent with EOC2 training set. The test set is as follows:

category model Quantity (pitch angle 17 ° & 15 °)
BMP2 9566 428
BMP2 C21 429
T72 812 426
T72 A04 573
T72 A05 573
T72 A07 573
T72 A10 567
total 3569

category	model	Quantity (pitch angle 17 ° & 15 °)
BMP2	9566	428
BMP2	C21	429
T72	812	426
T72	A04	573
T72	A05	573
T72	A07	573
T72	A10	567
total		3569

Here, the EOC2 trained model is directly used for prediction. The confusion matrix of the test set is as follows:

		   BMP2   BRDM2    BTR70   T72  
BMP9566	   359		0		17	    52		83.88%
BMPC21	   372		0		1		2		86.71%
T72_812		2		0		1	   423		99.30%
T72_A04		3		0		0	   570		99.48%
T72_A05		1		0		0	   572		99.83%
T72_A07		0		0		0	   573		100%
T72_A10		20		0		0	   547		96.47%		
Total: 95.74%

Keywords: Computer Vision Deep Learning convolution Object Detection

Added by salih0vicX on Mon, 20 Dec 2021 07:29:15 +0200

Programming VIP

SAR Target Classification Based on full convolution network

SAR Target Classification Based on full convolution network

1. MSTAR dataset expansion

2. Full convolution network model (AConvNets)

3. Classification results under SOC conditions

4. Classification results under eoc1

5. Classification results under eoc2 conditions

6. Classification results under eoc3

Popular Keywords