In this paper the multi-clustered modified quadratic discriminant function(MC-MQDF) is used to recognize cars and non-cars images. However the proposed method is capable to learn and classify into various kinds of vehicles: passenger vehicles, commercial vehicles, etc. The MC-MQDF is capable of estimating the complex distribution due to the variety of different possible appearances for preceding vehicles. In a complex distribution test including a variety of vehicles the classification rate for MC-MQDF was approximately 98% while using of the ordinary MQDF resulted in 93% success.
The recognition process can be described as follows:
Learning process.
Recognition Process.
Feature Extraction Feature extraction was done as follows: all images were normalized to 16x16 image. The feature vector was constructed from the grayscale image using the intensities.
Classification.
In MC-MQDF the clustering starts with the dimension parameter , and it is repeated
until the goal dimension by increasing
sequentially. For each dimension, the
k-means clustering is applied to the training samples with a dynamically changing
MQDF metric, which is changed by recomputing of the mean and covariance matrix from the
new members of each cluster. The MC-MQDF model estimation procedure can be described as follows.
Step 1. Obtain initial cluster centroids. (These are chosen at random from training
samples in the paper).
Step 2. Start the clutering with the dimension parameter .
Step 3. Compute the mean and covariance matrix for each cluster from the corresponding cluster members.
Step 4. Increase the dimension by an arbitrary step
Step 5. Re-assign each sample to clusters with the MQDF metric.
Step 6. Re-compute the mean and the covariance matrix for each cluster from its new members.
Step 7. Repeat from Step 5-7 until all cluster members finish changing.
Step 8. If dimension reaches to goal dimension, the MC-MQDF model estimation is completed.
Otherwise return to Step 4.
Database Description.
The database used in this paper had 5000 entries. These images were obtained from videotapes, which were recorded while a vehicle containing an onboard camera drove on highways and on roads in urban areas during daylight hours. The camera was 1/2 CCD monochrome camera with a 16mm-focal length lens, and the captured images consisted of 320x240 pixels with a 256-level gray scale. The evaluation images consist of three types: i) passenger vehicles (sedans, coupes, station wagons and vans); 2) commercial vehicles (buses, trucks, and dump trucks); and 3) motorcycles; In some scenes more then two vehicles were present. The number of motorcycles was very small in comparison to the others. We can see the concept of the proposed method:
Evaluation Method.
In order to reduce the computation time, the evaluations were carried out using test samples consisting of vehicle samples and non-vehicle samples, instead of classifying with a combination of all locations and sizes. Scenes to crop the training samples and the test samples were chosen such that they did not overlap. The training samples for the vehicle were drawn at random from the samples collected. The test samples for the vehicle class were drawn from the remaining images. For the vehicle class, ten training samples were cropped from each vehicle. The test samples were cropped from windows for which approximate locations were given. The training and test samples for the non-vehicle class were cropped at random from the corresponding road scenes, from which the training and test samples for the vehicle were cropped. The sizes of the samples were normalized to 16x16 after cropping the samples from scenes.
Obtained Results.
The maximum classification rate for MC-MQDF for passenger vehicles, commercial vehicles and motorcycles were 99.3%, 97.2%, and 95.6% respectively. The best classification rate for the MC-MQDF was 97.7%.
Kocurek 2007-12-17