Since the emergence of Blockchain as Bitcoin, its development has progressed rapidly and attracted the attention of various researchers in academia and industry. Blockchain technology is becoming an increasingly secure and effective way to share information in various industries, including finance, supply chain management (SCM), and the Internet of Things (IoT). The increase in the number of Blockchain users demands malicious and non-malicious transactions to maintain the trust in Blockchain. This research aims to develop a machine learning (ML) based model for classifying blockchain transactions into risky or non-risky ones. The model comprises four feature selection approaches, including Correlation-based Feature Selection (CFS), Recursive Feature Elimination (RFE), Random Forest (RF), and Information Gain (IG). Then, two ensemble feature selection methods, known as rank averaging and rank aggregation, are applied to combine the features selected from the initial feature selection methods. Various ML classification algorithms are applied to the selected features from two ensemble feature selection algorithms as the base learners to make initial predictions. Finally, three different ensemble base classifiers, including hard voting, soft voting, and weighted averaging, are applied to these initial predictions to make the final prediction. Three blockchain transactional datasets are considered for evaluating the proposed ensemble-based model. The empirical analysis of the reported work shows that the maximum accuracy obtained using the Rank Averaging ensemble feature selection technique is 99.24%, whereas the maximum accuracy using the Rank Aggregation ensemble feature technique is 98.73%.
Keywords: Blockchain; Ensemble feature selection; Machine learning; Rank aggregation; Rank averaging.
© 2025. The Author(s).