ArticlesOpen Access

An Overview of Data Science Algorithms

DOI: 10.18535/mcsj/v2020.05· Pages: 36-47· (2020)· Published: May 27, 2020
PDF
Views: 6 PDF downloads: 0

Abstract

Data science algorithms are on the way to becoming an integral part of every company, and we can already see the effects in many corporations that have invented their own data science teams and also implemented the latest data science algorithms. To be able to work with all the different challenges that are emerging, new powerful data science tools have been developed (e.g. Python, R, H2O, Weka, Tensorflow, Spark, Flink, BigML or KNIME). One balance that companies that want to use these new tools have to face is the cost of implementation vs. the enhanced development that they give in return. Nowadays, most of the advanced algorithms are open source and available on multiple platforms and programming languages, which helps to minimize the development cost challenges that each company has to overcome.

Still, one of the main dangers lurking inside these development teams is that they do not know what the state of the art of advanced algorithms is and which problem they can address. To help mitigate this problem, a review of algorithms has been implemented in this paper. This review gives us a perspective on which algorithms are being developed and which problem areas they can address. With the development of more powerful data science algorithms, we are also enabling the possibility of tackling more complex and interesting problems. However, one characteristic of the review is that there are missing algorithms from the many that are currently being produced and frequently selected by the community as the best performers in many benchmark datasets.

 

Keywords

AIadvanced data engineeringintelligent transportation systemstraffic managementmachine learningautonomous vehiclesreal-time analyticspredictive modelingIoT in transportationbig datasmart citiestransportation sustainabilityadaptive traffic controledge computingneural networksreinforcement learningdata pipelinescloud computingdata integrationmobility solutionsinfrastructure optimizationdynamic routingtraffic signal controlreal-time traffic predictiontraffic congestion reductiontransportation safetyurban mobilityscalable architecturesmultimodal transportationvehicle-to-everything (V2X)connected vehiclesautonomous drivingdata interoperabilitygeospatial analysistransportation efficiencyGPS systemstraffic flow optimizationsensor fusionvehicle navigationdata preprocessinganomaly detectionreal-time data ingestionAI in urban planningcybersecurity in ITSdata-driven decision makinggraph-based algorithmsdigital twinstransportation networksroad safetytransportation planningintelligent mobilityenergy-efficient transportationsustainable urban systemsenvironmental impact reductionAI in logisticsdata security in transportationfleet managementconnected infrastructureevent stream processing.
Author details
Vishwanadham Mandala
Service Delivery Lead, Cummins Inc
✉ Corresponding Author
👤 View Profile →🔗 Is this you? Claim this publication