AN AUTOENCODER-BASED FEATURE LEVEL FUSION FOR SPEECH EMOTION RECOGNITION