جستجوی مقالات مرتبط با کلیدواژه « Clustering » در نشریات گروه « صنایع »

تکرار جستجوی کلیدواژه « Clustering » در نشریات گروه « فنی و مهندسی »

انتخاب همه

توسعه ی الگوریتم بازشناسی اعداد دست نویس فارسی، بر پایه ی الگوریتم های طبقه بندی شبکه ی عصبی چندلایه و احتمالاتی، به کمک مراکز خوشه

علی میری، مجید خدمتی*

مجله مهندسی صنایع و مدیریت شریف، سال سی و نهم شماره 1 (بهار و تابستان 1402)، صص 23 -34

در این پژوهش تلاش شده است تا با ارایه ی الگوریتمی بهبودیافته و مبتنی بر خوشه بندی، بازشناسی اعداد دست نویس فارسی با دقت قابل توجهی صورت پذیرد. بر این اساس، آموزش و بازشناسی الگوها به کمک شبکه ی عصبی احتمالاتی و چندلایه ی پرسپترون میسر شده است، به این صورت که پس از استخراج دو دسته ویژگی مکان مشخصه و ناحیه یی از داده های آموزشی، داده های هریک از کلاس های دهگانه بر اساس هر ویژگی با استفاده از روش های پیوند کامل، P A M و F C M خوشه بندی شده و کلاس های دهگانه ی جدید حاصل از خوشه بندی، توسط یکی از دو الگوریتم طبقه بندی کننده آموزش می بینند. تعداد بهینه خوشه های هر کلاس با استفاده از الگوریتم بهینه سازی جست وجوی ممنوعه با تابع برازندگی نرخ بازشناسی تعیین می شود. میزان دقت الگوریتم در نهایت با استفاده از داده های آزمایش مورد سنجش قرار می گیرد و با توجه به نتایج ملاحظه می شود که الگوریتم پیشنهادی، بازشناسی اعداد دست نویس فارسی را با دقت بالایی انجام می دهد.

کلید واژگان: خوشه بندی, شبکه ی عصبی چندلایه, شبکه ی عصبی احتمالاتی, بازشناسی, جست وجوی ممنوعه}

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

DEVELOPMENT OF AN ALGORITHM FOR PERSIAN HANDWRITING DIGITS RECOGNITION BASED ON MLP AND PNN CLASSIFIERS AND USING CLUSTER CENTERS

A. Miri, M. Khedmati *

Industrial Engineering & Management Sharif, Volume:39 Issue: 1, 2023, PP 23 -34

Pattern recognition is a branch of machine learning that recognizes the patterns and regularities in a set of data, and digit recognition is considered one of the pattern recognition categories. Due to the similarities between some digits in each language, especially in Persian, different algorithms have been developed to recognize the handwriting digits with the least error and in the shortest time complexity. One of the most commonly used methods in data classi cation is the neural network algorithm. While neural networks have been used in the literature for handwriting digits recognition, the combination of clustering approaches and neural network classi ers has not been considered for this problem. Accordingly, this paper proposes an algorithm based on the combination of clustering approaches and neural network classi ers to recognize the Persian handwritten digits accurately. This algorithm performs pattern training and recognition based on Probabilistic Neural Networks (PNN) and multilayer perceptron (MLP) neural networks. In this regard, after extracting the characteristic loci feature and zoning from each image in the training database, the data of each of the ten classes has been clustered using linkage, Partition Around Medoids (PAM), and Fuzzy C-Means (FCM) methods based on the extracted features. Then, the new ten classes resulting from the clustering algorithm are taught by one of the two classi ers, including MLP and PNN. In order to determine the optimal number of clusters in each class, the Tabu search optimization algorithm, one of the most accurate meta-heuristic optimization algorithms, is used. The performance of the proposed algorithms is evaluated and compared with existing algorithms based on the HODA dataset. Based on the results, the proposed algorithm accurately recognizes the Persian handwritten digits. In addition, the proposed method performs more accurately and much faster than most competing algorithms.

Keywords: Clustering, MLP, PNN, digit recognition, tabu search}

Abstract View Paper Research/Original Article Original: Persian
ارایه یک مدل ترکیبی به منظور تحلیل تطبیقی الگوریتم های خوشه بندی داده های مالی

مجتبی موحدی، مهدی همایونفر*، مهدی فدایی، منصور صوفی

نشریه تصمیم گیری و تحقیق در عملیات، سال هشتم شماره 2 (تابستان 1402)، صص 507 -526

هدف

الگوریتم های خوشه بندی، ابزارهای مفیدی برای درک ساختار داده ها و طبقه بندی آن ها در مجموعه داده های مختلف می باشند. با توجه به اهمیت به کارگیری این الگوریتم ها در تحلیل داده های بازارهای مالی که از حجم و گستردگی بالایی برخوردارند، این پژوهش به منظور انتخاب بهترین الگوریتم خوشه بندی برای خوشه بندی شرکت های حاضر در بورس اوراق بهادار تهران در حوزه مالی از الگوریتم های خوشه بندی مختلف استفاده نموده و به ارزیابی اعتبار این الگوریتم ها و انتخاب بهترین الگوریتم پرداخته است.

روش شناسی پژوهش:

این پژوهش از نظر هدف، کاربردی و از نظر روش اجرا توصیفی و از نوع کمی (مدل سازی ریاضی) است. جامعه آماری تحقیق شامل 403 شرکت حاضر در بورس اوراق بهادار تهران در سال 98 است که عملکرد آن ها بر اساس چهار معیار مالی ارزیابی شده است.

یافته ها

پس از خوشه بندی شرکت های مورد بررسی توسط پنج الگوریتم خوشه بندی K-Means، EM، COBWEB، الگوریتم مبتنی بر چگالی و روش وارد، از هفت شاخص RS، DB، دان، SD، خلوص، آنتروپی و زمان برای ارزیابی الگوریتم های خوشه بندی استفاده گردید. در نهایت، عملکرد نهایی الگوریتم های به کار رفته بر اساس روش های تاپسیس، ویکور و تحلیل پوششی داده ها مورد تجزیه وتحلیل قرار گرفت. بر اساس نتایج، روش K-Means از عملکرد بهتری در خوشه بندی شرکت ها بر اساس مجموعه داده های مالی برخوردار است.

اصالت/ارزش افزوده علمی:

از آن جایی که هیچ الگوریتم خوشه بندی نمی تواند بهترین عملکرد را در تمام اندازه گیری ها برای هر مجموعه داده داشته باشد، این پژوهش ضمن به کارگیری ترکیبی از معیارهای چند گانه به منظور تجزیه وتحلیل الگوریتم های خوشه بندی داده های مربوط به حوزه ارزیابی عملکرد مالی شرکت ها، به ارایه پیشنهاداتی پرداخته و نتایج این پژوهش برای سرمایه گذاران حوزه مالی کاربرد موثر داشته که منجر به انتخاب بهینه سبد سرمایه گذاری می شود.

کلید واژگان: ارزیابی عملکرد مالی, تصمیم گیری چند معیاره, خوشه بندی}

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Developing a hybrid model for comparative analysis of financial data clustering algorithms

Mojtaba Movahedi, Mahdi Homayounfar *, Mehdi Fadaei, Mansour Soufi

Journal of Decisions and Operations Research, Volume:8 Issue: 2, 2023, PP 507 -526

Purpose

Clustering algorithms are useful tools for understanding data structure and classifying them into different data sets. Due to the importance of using these algorithms in analyzing financial market data that have a high volume and scope, this study in order to select the best clustering algorithm for clustering companies listed on the Tehran Stock Exchange in the field of finance from It has used different clustering algorithms and evaluated the validity of these algorithms and selected the best algorithm.

Methodology

This research is applied in terms of purpose and descriptive in terms of implementation method and is of quantitative type (mathematical modeling). The statistical population of the research includes 403 companies listed on the Tehran Stock Exchange in 2019, whose performance has been evaluated based on four financial criteria.

Findings

After clustering the surveyed companies by five clustering algorithms, namely K-means, EM, COBWEB, density-based algorithm and ward method, seven indicators RS, DB, Dun, SD, Purity, Entropy and Time were used to evaluate the algorithms. Finally, the total performance of the algorithms was analyzed based on TOPSIS, VICOR and DEA methods. Based on the results, K-means has a better performance in clustering based on the financial data sets.

Originality/Value:

Since no clustering algorithm can have the best performance in all measurements for each data set, this study uses a combination of multiple criteria to analyze data clustering algorithms related to the field of financial performance appraisal. Companies have provided suggestions and the results of this study have been used effectively for investors in the field of finance, which leads to the optimal choice of investment portfolio.

Keywords: Clustering, Multi-criteria decision making, Financial Performance Evaluation}

Abstract View Paper Research/Original Article Original: Persian
An interpretable machine learning Framework for customer churn Prediction: A Case Study in the Telecommunications Industry

MohammadJavad Jafari, M. J. Tarokh *, Paria Soleimani

Journal of Industrial Engineering and Management Studies, Volume:10 Issue: 1, Winter-Spring 2023, PP 141 -157

Customer churn prediction has been gaining significant attention due to the increasing competition among mobile service providers. Machine learning algorithms are commonly used to predict churn; however, their performance can still be improved due to the complexity of customer data structure. Additionally, the lack of interpretability in their results leads to a lack of trust among managers. In this study, a step-by-step framework consisting of three layers is proposed to predict customer churn with high interpretability. The first layer utilizes data preprocessing techniques, the second layer proposes a novel classification model based on supervised and unsupervised algorithms, and the third layer uses evaluation criteria to improve interpretability. The proposed model outperforms existing models in both predictive and descriptive scores. The novelties of this paper lie in proposing a hybrid machine learning model for customer churn prediction and evaluating its interpretability using extracted indicators. Results demonstrate the superiority of clustered dataset versions of models over non-clustered versions, with KNN achieving a recall score of almost 99% for the first layer and the cluster decision tree achieving a 96% recall score for the second layer. Additionally, parameter sensitivity and stability are found to be effective interpretability evaluation metrics.

Keywords: Machine Learning, customer churn prediction, Interpretability, Clustering, Classification}

Abstract View Paper Research/Original Article Original: English
A novel two-level clustering algorithm for time series group forecasting.

Ali Ghorbanian, Hamideh Razavi *

Journal of Industrial and Systems Engineering, Volume:15 Issue: 1, Winter 2023, PP 54 -68

Parametric models are considered the widespread methods for time series forecasting. Non-parametric or machine learning methods have significantly replaced statistical methods in recent years. In this study, we develop a novel two-level clustering algorithm to forecast short-length time series datasets using a multi-step approach, including clustering, sliding window, and MLP neural network. In first-level clustering, the time series dataset in the training part is clustered. Then, we made a long time series by concatenating the existing time series in each cluster in the first level. After that, using a sliding window, every long-time series created in the previous step is restructured to equal-size sub-series and clustered in the second level. Applying an MLP network, a model has been fitted to final clusters. Finally, the test data distance is calculated with the center of the final cluster, selecting the nearest distance, and using the fitted model in that cluster, the final forecasting is done. Using the WAPE index, we compare the one-level clustering algorithm in the literature regarding the mean of answers and the best answer in a ten-time run. The results reveal that the algorithm could increase the WAPE index value in terms of the mean and the best solution by 8.78% and 5.24%, respectively. Also, comparing the standard deviation of different runs shows that the proposed algorithm could be further stabilized with a 3.24 decline in this index. This novel study proposed a two-level clustering for forecasting short-length time series datasets, improving the accuracy and stability of time series forecasting.

Keywords: time series, Clustering, Forecasting, sliding window, Neural Network}

Abstract View Paper Research/Original Article Original: English
الگوریتم ترکیبی KMPCA برای انتخاب فناوری با استفاده از خوشه بندی و تحلیل مولفه های اصلی

عباس سرافرازی*

مجله توسعه تکنولوژی صنعتی، پیاپی 51 (بهار 1402)، صص 85 -100

انتخاب فناوری مناسب، مسیله مهمی است که بنگاه های تولیدی و صنعتی با آن مواجه هستند. این درحالی است که، دسترسی به فناوری های جدید، مجموعه انتخاب را وسیع کرده است بطوریکه حل مسیله انتخاب فناوری با وجود معیارهای تصمیم متعدد، بیش از پیش مشکل و پیچیده شده است. از طرفی فناوری مناسب می تواند مزایای رقابتی قابل توجه ای را برای یک شرکت در یک محیط پیچیده کسب و کار ایجاد نماید. تاکنون روش های مختلفی جهت حل مسیله انتخاب فناوری ارایه گردیده است که هر یک دارای مزایا و معایبی هستند ولی هیج یک از روش های پیشنهادی به طور واحد کلیه قابلیت های لازم را ندارند. در این مقاله با بکارگیری روش تحلیل مولف های اصلی، روش ترکیبی خوشه بندی و تحلیل مولف های اصلی به همراه تیوری فازی، الگوریتم ترکیبی KMPCA در حل مسیله انتخاب فناوری توسعه داده شده است. تعداد متغیرهای انتخاب فناوری از 6 به 14 متغیر با پوشش کامل تری از ابعاد تصمیم ارتقاء یافت و داده ها از 49 فناوری رایج صنعت سنگ ایران جمع آوری و در مدل آزمون گردید. نتایج این تحقیق ضمن بهبود حل، کاهش ابعاد مسیله و کاهش روابط چند همخطی میان داده ها را در فرآیند انتخاب فناوری نشان می دهد.

کلید واژگان: خوشه بندی, K-Means, تحلیل مولفه های اصلی, PCA, KMPCA, انتخاب فناوری}

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

The K MPCA hybrid algorithm to selection the technology using clustering and principal component analysis

Abbas Sarafrazi *

Journal of Industrial Technology Development, Volume:21 Issue: 51, 2023, PP 85 -100

Choosing the right technology is an important issue faced by manufacturing and industrial companies. This is while the access to new technologies has widened the selection set so that solving the problem of technology selection has become more difficult and complicated despite multiple decision criteria. On the other hand, appropriate technology can create significant competitive advantages for a company in a complex business environment. So far, various methods have been presented to solve the problem of technology selection, each of which has advantages and disadvantages, but none of the proposed methods have all the necessary capabilities. In this article, by using the method of principal components analysis, the combined method of clustering and analysis of principal components along with fuzzy theory, the combined KMPCA algorithm has been developed in solving the technology selection problem. The number of technology selection variables was increased from 6 to 14 variables with a more complete coverage of decision dimensions, and data from 49 currently technologies of Iran's stone industry were collected and tested in the model. The results of this research, while improving the solution, show the reduction of the dimensions of the problem and the reduction of multi-collinear relationships between the data in the technology selection process.

Keywords: technology selection, clustering, K-Means, principal component analysis, PCA, KMPCA}

Abstract View Paper Research/Original Article Original: Persian
تسهیل فروش متقاطع خدمات ارزش افزوده تلفن همراه با استفاده از تکنیک های داده کاوی

حمیدرضا عاطفی*، جلال رضایی نور

نشریه مدیریت مهندسی و رایانش نرم، سال نهم شماره 1 (پیاپی 16، بهار و تابستان 1402)، صص 171 -193

کسب مزیت رقابتی برای اپراتورهای تلفن همراه بسیار با اهمیت است. خدمات ارزش افزوده تلفن همراه یکی از نوآوری هایی است که اپراتور ها از آن برای تنوع بخشیدن به کسب و کار خود استفاده می کنند. فروش متقاطع برای اپراتورهای تلفن همراه، برای گسترش درآمد و سود بسیار مهم است. زیرا اپراتورها هزینه های جانبی کمتری را در مقایسه با جذب مشتریان جدید متحمل خواهند شد. اما شناخت مشتریان بالقوه خرید خدمات ارایه شده توسط اپراتورها، برای آنها ساده نیست. در این مقاله، برای تسهیل فروش متقاطع خدمات ارزش افزوده تلفن همراه تلاش شده است. داده های استفاده شده در این تحقیق اطلاعات مربوط به خرید های گذشته مشتریان شرکت همراه اول از خدمات ارزش افزوده تلفن همراه است. در راهکار ارایه شده، به زیر ساخت های ایجاد پروفایل فروش متقاطع مشتریان پرداخته شده است. در این راهکار، پس از تعیین دسته بهینه ای از مشتریان با استفاده از خوشه بندی آنها، برای کشف قوانین میان خدمات استفاده شده توسط مشتریان، تلاش شده است. با ساخت این پروفایل می توان جامعه هدفی برای فروش متقاطع هر یک از خدمات بدست آورد.

کلید واژگان: خدمات ارزش افزوده, خوشه بندی, فروش متقاطع, قوانین وابستگی}

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Facilitate cross-selling of value-added mobile services using data mining

Hamidreza Atefi *, Jalal Rezaeenour

Engineering Management and Soft Computing, Volume:9 Issue: 1, 2023, PP 171 -193

Gaining a competitive advantage is very important for mobile operators. Mobile value-added services are one of the innovations that operators use to diversify their business. Cross-selling is crucial for mobile operators to generate revenue and profits. Because operators will incur lower ancillary costs compared to attracting new customers. But it is not easy for them to identify potential customers who buy the services provided by operators. In this article, an attempt has been made to facilitate the cross-selling of mobile value-added services. The data used in this research is information about the past purchases of the customers of HamrahAval Company from the value-added mobile services. In the proposed solution, the infrastructure for creating cross-selling customer profiles is discussed. In this solution, after determining the optimal category of customers using their clustering, an attempt has been made to discover the rules between the services used by customers. By creating this profile, a target community can be achieved for the cross-selling of each service.

Keywords: Association rule, Clustering, Cross sale, Value-added services}

Abstract View Paper Research/Original Article Original: Persian
خوشه بندی شبکه حسگر بی سیم مبتنی بر الگوریتم انتشار برچسب

منیره سادات یونس پور*، مرتضی رموزی

نشریه مدیریت مهندسی و رایانش نرم، سال هشتم شماره 2 (پیاپی 15، پاییز و زمستان 1401)، صص 16 -28

شبکه حسگر بی سیم یک فناوری در حال رشد است.در شبکه های سنسور بی سیم کارایی شبکه معمولا تحت تاثیر محدودیت انرژی است. در این مقاله روش پیشنهادی بر مبانی الگوریتم انتشار برچسب برای غلبه بر این محدودیت ارایه شده است. ابتدا از سنسورها گراف تشکیل شده و در مرحله بعدی وزن دهی به یال های این گراف بر اساس چهار معیار شباهت انجام می شود. سپس برای هر گره مرکزیت و برچسب اولیه به دست می آید و درنهایت با به روزرسانی برچسب ها، گره هایی که برچسب یکسان دارند در یک خوشه قرار می گیرند. نتایج حاصل شده از روش پیشنهادی با معیارهای تعداد گره های زنده و میانگین انرژی گره های زنده با روش لیچ (leach) مقایسه شده است. که نتایج نشان دهنده این است که درروش پیشنهادی محل قرار گرفتن سنسورها و تنظیم مقدار آستانه برای تشکیل گراف از سنسورها جز متغیرهای اساسی است و مقایسه نشان دهنده برتری روش پیشنهادی نسبت به روش لیچ است.

کلید واژگان: شبکه حسگر بی سیم, خوشه بندی, انتشار برچسب}

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Clustering Wireless Sensor Networks Based on Label Propagation Algorithm1

Monireh Sadat Younespour *, Morteza Romoozi

Engineering Management and Soft Computing, Volume:8 Issue: 2, 2022, PP 16 -28

Wireless sensor network is a growing technology. In wireless sensor networks, performance is usually affected by energy constraints. In this paper, a method is proposed based on the label propagation algorithm for this limitation. At first, the sensors are composed of a graph In the next stage weighing the edges of this graph is based on four similarity measure Then for each node, the centrality and the initial label are obtained And finally, by updating the lebel,nodes with the same label are placed in a cluster The results of the proposed method have been compared with the number of live nodes and the mean energy of live nodes measures by the leach method The results indicate that in the proposed method of positioning the sensors and setting the threshold value for the formation of the graph from the sensors are only fundamental variables And the comparison shows that the proposed method is superior to the leach method

Keywords: Wireless sensor network, Clustering, label propagation}

Abstract View Paper Research/Original Article Original: Persian
ارائه روشی به منظور کاهش مصرف انرژی در شبکه های حسگر بی سیم با استفاده از الگوریتم خوشه بندی DBSCAN

رضا مولایی فرد*

نشریه مدیریت مهندسی و رایانش نرم، سال هشتم شماره 2 (پیاپی 15، پاییز و زمستان 1401)، صص 273 -295

در این تحقیق، به ارزیابی روش های خوشه بندی انرژی کارآمد در شبکه های حسگر بی سیم می پردازیم. نتایج حاصل از این پژوهش حاکی از آن است که خوشه بندی توسط DBSCAN، امتیاز کارایی بالاتری نسبت به سایر روش های خوشه بندی به دست می آورد. به طوری که DBSCAN امتیاز کارایی 99٪ را به دست آورد اما الگوریتم K-Means، امتیاز کارایی 76٪ را به دست آورد. همچنین انرژی باقیمانده در شبکه پس از اتمام شبیه سازی در مسیریابی با پروتکل جدید حدود 11٪ بیشتر از مسیریابی EEHC و حدود 9٪ بیشتر از مسیریابی با پروتکل LCA است. اگر طول عمر شبکه را در زمان خاموش شدن اولین گره در شبکه در نظر بگیریم در پروتکل جدید اولین گره 6 ثانیه دیرتر از پروتکل EEHC و 12 ثانیه دیرتر از پروتکل LCA خاموش می شود. این بدین معناست که به طور میانگین حدود 10٪ طول عمر شبکه با پروتکل جدید افزایش یافته است. پروتکل I-LEACH راندمان انرژی و طول عمر را باکار بیشتر در همان ساختارها در مقایسه با پروتکل معمولی LEACH بهبود می بخشد.

کلید واژگان: پروتکل I-LEACH, خوشه بندی, شبکه حسگر بی سیم, کاهش انرژی}

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

provide a method to reduce energy consumption in wireless sensor networks using DBSCAN clustering algorithm

Reza Molaee Fard *

Engineering Management and Soft Computing, Volume:8 Issue: 2, 2022, PP 273 -295

In this research, we evaluate energy efficient clustering methods in wireless sensor networks. The results of this study indicate that clustering by DBSCAN has a higher efficiency score than other clustering methods. The DBSCAN scored 99%, but the K-Means algorithm scored 76%. Also, the energy remaining in the network after the simulation is completed in routing with the new protocol is about 11% more than routing with EEHC and about 9% more than routing with LCA protocol. If we consider the lifespan of the network when the first node in the network is turned off, in the new protocol, the first node shuts down 6 seconds later than the EEHC protocol and 12 seconds later than the LCA protocol. This means that on average, about 10% of network life has been increased with the new protocol. The I-LEACH protocol improves energy efficiency and longevity by working more in the same structures than the conventional LEACH protocol.

Keywords: Wireless sensor network, Clustering, energy reduction, I-LEACH algorithm}

Abstract View Paper Research/Original Article Original: Persian
Clustering of fuzzy data sets based on particle swarm optimization with fuzzy cluster centers

Seyed Hamid Zahiri*, Najme Ghanbari, Hadi Shahraki

International Journal of Industrial Engineering and Productional Research, Volume:33 Issue: 2, Jun 2022, P 13

In current study, a particle swarm clustering method is suggested for clustering triangular fuzzy data. This clustering method can find fuzzy cluster centers in the proposed method, where fuzzy cluster centers contain more points from the corresponding cluster, the higher clustering accuracy. Also, triangular fuzzy numbers are utilized to demonstrate uncertain data. To compare triangular fuzzy numbers, a similarity criterion based on the intersection region of the fuzzy numbers is used. The performance of the suggested clustering method has been experimented on both benchmark and artificial datasets. These datasets are used in the fuzzy form. The experiential results represent that the suggested clustering method with fuzzy cluster centers can cluster triangular fuzzy datasets like other standard uncertain data clustering methods. Experimental results demonstrate that, in almost all datasets, the proposed clustering method provides better results in accuracy when compared to Uncertain K-Means and Uncertain K-medoids algorithms.

Keywords: Clustering, Particle swarm clustering method, Uncertain data, Triangular fuzzy data, Fuzzy cluster centers, Similarity value}

Abstract View Paper Research/Original Article Original: English
مقایسه ی عملکرد روش های اندازه گیری شباهت طولانی ترین زیردنباله ی مشترک و چرخش زمانی پویا در داده کاوی سری های زمانی

غلامرضا سلیمانی، مسعود عابسی *

مجله مهندسی صنایع و مدیریت شریف، سال سی و هفتم شماره 2 (پاییز و زمستان 1400)، صص 79 -90

تکنیک‌های داده‌کاوی به‌طور خاص برای داده‌های ثابت طراحی شده‌اند. لذا به‌کارگیری آنها برای داده‌های سری زمانی نیازمند اعمال تغییراتی(روش اندازه‌گیری شباهت) است. براساس تحقیقات اخیر، روش‌های طولانی‌ترین زیردنباله‌ی مشترک و چرخش زمانی پویا، از پرکاربردترین و کاراترین این روش‌ها محسوب می‌شود. در این تحقیق، قصد داریم تا عملکرد این روش‌ها را در تکنیک‌های نزدیک‌ترین همسایگی و خوشه‌بندی کامدوید مورد ارزیابی و مقایسه قرار داده تا بتوان از آنها با دقت بهتری در این تکنیک‌ها و در مسایلی نظیر قسمت‌بندی مشتریان، زمان‌بندی کارگاه و... استفاده کرد. به همین منظور از 63 مجموعه داده سری زمانی از بانک اطلاعاتی UCR، استفاده می‌شود. نتایج نشان می‌دهد که تاثیرآنها در دقت تشخیص درست دسته‌ی سری زمانی و دقت خوشه‌بندی، به‌طور معناداری تفاوت دارد، ولی تاثیر آنها در تعیین تعداد خوشه و نماینده‌ی خوشه، تفاوت معناداری ندارد.

کلید واژگان: داده کاوی سری های زمانی, خوشه بندی, نزدیک ترین همسایگی, طولانی ترین زیردنباله ی مشترک, چرخش زمانی پویا}

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

PERFORMANCE COMPARISON OF THE LONGEST COMMON SUBSEQUENCE AND DYNAMIC TIME WARPING IN TIME SERIES DATA MINING

Gh. Soleimani, M. Abessi*

Industrial Engineering & Management Sharif, Volume:37 Issue: 2, 2022, PP 79 -90

Today, the use of data mining techniques such as classification, clustering, discover repetitive pattern and discover outliers in different domains including production, medicine, social, meteorology, stock exchange, sales, customer service and other areas are increasing. Data mining techniques are specifically designed for static data. Therefore, their use for time series data requires some modifications to their respective algorithms. One of these changes is the selection of the appropriate similarity measurement method, because similarity measurement methods are used in all data mining techniques. Therefore, in this research, we will evaluate and compare the effect of two commonly used and efficient methods of time series similarity measurement in data mining. This evaluation is done in relation to the effectiveness of these methods in achieving better results. These methods are the Longest Common Sub Sequence (LCSS) method and the Dynamic time Warping (DTW) method. The main purpose of this research is to compare the performance of these methods in time series data mining. The data mining techniques that used in this research are the nearest-neighbor technique and k-medoids clustering algorithm. The performance evaluation process is described in the text. This process uses the nearest-neighbor technique to calculate the accuracy of detection of right time series class, and uses the k-medoids clustering technique to calculate the clustering accuracy, the ability to correctly determine the number of clusters, and the ability to determine the better cluster representative. For this purpose, we use 63 time series data sets by random from a world-renowned database that named UCR collection. The results show that the effect of LCSS method is significantly better than the effect of DTW method on the correct detection accuracy of time series class and clustering accuracy by 99% and 92.5% confidence, respectively, but there is no significant difference between them in terms of their effect in determining the number of clusters and cluster representatives. The results of this research help to use these methods in appropriate data mining techniques in issues such as customer segmentation, workshop scheduling and the like more accurately.

Keywords: Time series data mining, clustering, nearest neighbor, longest common subsequence, dynamic time warping}

Abstract View Paper Research/Original Article Original: Persian
CUSTOMER CLUSTERING BASED ON FACTORS OF CUSTOMER LIFETIME VALUE WITH DATA MINING TECHNIQUE

Elaheh Bakhshizadeh, Hossein Aliasghari, Rassoul Noorossana*, Rouzbeh Ghousi

International Journal of Industrial Engineering and Productional Research, Volume:33 Issue: 1, Mar 2022, P 3

Organizations have used Customer Lifetime Value (CLV) as an appropriate pattern to classify their customers. Data mining techniques have enabled organizations to analyze their customers’ behaviors more quantitatively. This research has been carried out to cluster customers based on factors of CLV model including length, recency, frequency, and monetary (LRFM) through data mining. Based on LRFM, transaction data of 1865 customers in a software company has been analyzed through Crisp-DM method and the research roadmap. Four CLV factors have been developed based on feature selection algorithm. They also have been prepared for clustering using quintile method. To determine the optimum number of clusters, silhouette and SSE indexes have been evaluated. Additionally, k-means algorithm has been applied to cluster the customers. Then, CLV amounts have been evaluated and the clusters have been ranked. The results show that customers have been clustered in 4 groups namely high value loyal customers, uncertain lost customers, uncertain new customers, and high consumption cost customers. The first cluster customers with the highest number and the highest CLV are the most valuable customers and the fourth, third, and second cluster customers are in the second, third, and fourth positions respectively. The attributes of customers in each cluster have been analyzed and the marketing strategies have been proposed for each group.

Keywords: Customer lifetime value, LRFM model, Data mining, Clustering}

Abstract View Paper Research/Original Article Original: English
خوشه بندی شبکه حسگر بی سیم مبتنی بر الگوریتم انتشار برچسب

منیره سادات یونس پور*، مرتضی رموزی

نشریه مدیریت مهندسی و رایانش نرم، سال هفتم شماره 2 (پیاپی 13، پاییز و زمستان 1400)، صص 31 -49

شبکه حسگر بی سیم یک فناوری در حال رشد است.در شبکه های سنسور بی سیم کارایی شبکه معمولا تحت تاثیر محدودیت انرژی است. در این مقاله روش پیشنهادی بر مبانی الگوریتم انتشار برچسب برای غلبه بر این محدودیت ارایه شده است. ابتدا از سنسورها گراف تشکیل شده و در مرحله بعدی وزن دهی به یال های این گراف بر اساس چهار معیار شباهت انجام می شود. سپس برای هر گره مرکزیت و برچسب اولیه به دست می آید و درنهایت با به روزرسانی برچسب ها، گره هایی که برچسب یکسان دارند در یک خوشه قرار می گیرند. نتایج حاصل شده از روش پیشنهادی با معیارهای تعداد گره های زنده و میانگین انرژی گره های زنده با روش لیچ (leach) مقایسه شده است. که نتایج نشان دهنده این است که درروش پیشنهادی محل قرار گرفتن سنسورها و تنظیم مقدار آستانه برای تشکیل گراف از سنسورها جز متغیرهای اساسی است و مقایسه نشان دهنده برتری روش پیشنهادی نسبت به روش لیچ است.

کلید واژگان: شبکه حسگر بی سیم, خوشه بندی, انتشار برچسب}

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

wireless sensor network clustering based on label propagation algorithm

Monireh Sadat Younespour *, Morteza Romoozi

Engineering Management and Soft Computing, Volume:7 Issue: 2, 2022, PP 31 -49

Wireless sensor network is a growing technology. In wireless sensor networks, performance is usually affected by energy constraints. In this paper, a method is proposed based on the label propagation algorithm for this limitation. At first, the sensors are composed of a graph In the next stage weighing the edges of this graph is based on four similarity measure Then for each node, the centrality and the initial label are obtained And finally, by updating the lebel,nodes with the same label are placed in a cluster The results of the proposed method have been compared with the number of live nodes and the mean energy of live nodes measures by the leach method The results indicate that in the proposed method of positioning the sensors and setting the threshold value for the formation of the graph from the sensors are only fundamental variables And the comparison shows that the proposed method is superior to the leach method

Keywords: Wireless sensor network, Clustering, label propagation}

Abstract View Paper Research/Original Article Original: Persian
ارائه روشی به منظور کاهش مصرف انرژی در شبکه های حسگر بی سیم با استفاده از الگوریتم خوشه بندی DBSCAN

رضا مولایی فرد*

نشریه مدیریت مهندسی و رایانش نرم، سال هفتم شماره 2 (پیاپی 13، پاییز و زمستان 1400)، صص 273 -295

در این تحقیق، به ارزیابی روش های خوشه بندی انرژی کارآمد در شبکه های حسگر بی سیم می پردازیم. نتایج حاصل از این پژوهش حاکی از آن است که خوشه بندی توسط DBSCAN، امتیاز کارایی بالاتری نسبت به سایر روش های خوشه بندی به دست می آورد. به طوری که DBSCAN امتیاز کارایی 99٪ را به دست آورد اما الگوریتم K-Means، امتیاز کارایی 76٪ را به دست آورد. همچنین انرژی باقیمانده در شبکه پس از اتمام شبیه سازی در مسیریابی با پروتکل جدید حدود 11٪ بیشتر از مسیریابی EEHC و حدود 9٪ بیشتر از مسیریابی با پروتکل LCA است. اگر طول عمر شبکه را در زمان خاموش شدن اولین گره در شبکه در نظر بگیریم در پروتکل جدید اولین گره 6 ثانیه دیرتر از پروتکل EEHC و 12 ثانیه دیرتر از پروتکل LCA خاموش می شود. این بدین معناست که به طور میانگین حدود 10٪ طول عمر شبکه با پروتکل جدید افزایش یافته است. پروتکل I-LEACH راندمان انرژی و طول عمر را باکار بیشتر در همان ساختارها در مقایسه با پروتکل معمولی LEACH بهبود می بخشد.

کلید واژگان: پروتکل I-LEACH, خوشه بندی, شبکه حسگر بی سیم, کاهش انرژی}

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

provide a method to reduce energy consumption in wireless sensor networks using DBSCAN clustering algorithm

Reza Molaee Fard *

Engineering Management and Soft Computing, Volume:7 Issue: 2, 2022, PP 273 -295

In this research, a new method is presented to improve the recommendation systems in the field of health tourism, which can make accurate predictions by using participatory filtering and by using the points that previous tourists have given to places and health professionals in our country. For the use of tourists. According to the research, data clustering using DBSCAN algorithm obtained 99% efficiency score, which is the highest efficiency score among the existing algorithms. Also, SVM method has 95% score in accuracy section and 99% score in call section. Which shows the high accuracy of predicting the results and the proposed method in general up to 80% can correctly identify the places needed by the tourist and suggest the appropriate place to a large extent correctly

Keywords: Wireless sensor network, Clustering, energy reduction, I-LEACH algorithm}

Abstract View Paper Research/Original Article Original: Persian
Clustering of basketball players using self-organizing map neural networks

Soroush Babaee Khobdeh, MohammadReza Yamaghani *, Siavash Khodaparast Sareshkeh

Journal of Applied Research on Industrial Engineering, Volume:8 Issue: 4, Autumn 2021, PP 412 -428

Clustering players based on their abilities, a new perspective and an important opportunity to meet needs that in the light of traditional talent identification and player science, which is held periodically and there is not enough time for them to appear. Early recognition of these abilities is a factor influencing the success of sports teams. Artificial Neural Network (ANN) is a new method of modelling and prediction. The aim of this study was to cluster basketball players based on their individual abilities. For this purpose, Self-Organizing Map (SOM) Neural Networks (NNs) were used. The data set used by 3000 NBA players for 2011 until 2018 is from the Basketball-Reference[1] site. Each player is assigned 30 attributes to reduce them using the Principal Component Analysis (PCA) method and the features for each player were reduced to 12 samples. In order to implement a SOM of features and functions in MATLAB software 65% of the data were used as the network training phase and the remaining 35% were used to the test phase. 12 players’ features as network input and output 9 clusters resulting from the combination of features. After simulation using SOM, accuracy parameter with the help of this system were obtained above 95%. The result of the study showed that the performance of the SOM in clustering basketball players was higher than the K-Means algorithm. The network implemented in this article has a faster speed in the training process and generalizability than similar cases.

Keywords: Basketball, Clustering, k-means, Self-Organizing Map (SOM), Neural Networks (NNs), discriminant Analysis, PCA}

Abstract View Paper Research/Original Article Original: English
Modeling Financial Supply of State Social Expenditures: The «Human-Center» Tax Principle

Larysa Vasyurenko *, Ihor Kuksa, Valerii Danylenko, Valeriia Ostashova, Liubov Kysliuk, Olena Naholiuk, Maksym Sukhoruchenko

International Journal of Supply and Operations Management, Volume:8 Issue: 4, Autumn 2021, PP 370 -380

Comprehensive economic development is possible only with the balance of interests of business entities and the state, which should be reflected in financial policy. In this case, the transformation of the fiscal system should take into account the stage of economic development of the country. An information array consisting of 36 countries and 10 socio-economic indicators was adopted as the basis for the development of benchmarks for assessing the effectiveness of public resources for the implementation of social policies in the region. The basic features which characterize the state of social orientation of the state policy of the countries in correlation of the spheres of social expenditures and the national system of taxation as social arguments are outlined. Comparative intercluster characteristics are identified and essential differential and baseline characteristics are distinguished. In order to determine the rationality and effectiveness of the current tax system and its impact in the field of social guarantees of the state as well as to increase the degree of social protection of the most needy population, a methodological approach was proposed, using a multidimensional statistical procedure, cluster ranking, which allows the grouping of objects on several grounds simultaneously to define main characteristics of the studied world economies for simulation of “bench marking” – system of financial support of state social expenditures, built on the principle of "human-center" taxation.

Keywords: Finance, Supply management, Taxation, Social expenditures, Clustering, State policy}

Abstract View Paper Research/Original Article Original: English
A New Multi-Stage Feature Selection and Classification Approach: Bank Customer Credit Risk Scoring

Farshid Abdi *

Journal Of Industrial Engineering International, Volume:17 Issue: 1, Winter 2021, PP 78 -87

Lots of information about customers are stored in the databases of banks. These databases can be used to assess the credit risk. Feature selection is a well-known concept to reduce the dimension of such databases. In this paper, a multi-stage feature selection approach is proposed to reduce the dimension of database of an Iranian bank including 50 features. The first stage of this paper is devoted to removal of correlated features. The second stage of it is allocated to select the important features with genetic algorithm. The third stage is proposed to weight the variables using different filtering methods. The fourth stage selects feature through clustering algorithm. Finally, selected features are entered into the K-nearest neighbor (K-NN) and Decision Tree (DT) classification algorithms. The aim of the paper is to predict the likelihood of risk for each customer based on effective and optimum subset of features available from the customers.

Keywords: Clustering, Credit risk prediction, filtering method, Genetic Algorithm, Hybrid feature selection}

Abstract View Paper Research/Original Article Original: English
Robust Cluster-Based method for monitoring generalized linear profiles in phase I

Davood Saremian, Rassoul Noorossana *, Sadigh Raissi, Paria Soleimani

Journal Of Industrial Engineering International, Volume:17 Issue: 1, Winter 2021, PP 88 -97

Profile monitoring is one of the new statistical quality control methods used to evaluate the functional relationship between the descriptive and response variables to measure the process quality. Most of the studies in this field concern processes whose response variables follow the normal distribution function, but in many industries and services, this assumption is not true. The presence of outliers in the historical data set could have a deleterious effect on phase I parameter estimation. Therefore, in this paper, we propose a robust cluster-based method for estimating the parameters of generalized linear profiles in phase I. In this method, the effect of data contamination on estimating the generalized linear model parameters is reduced and as a result, the performance of T^2 control charts is improved. The performance of this method has been evaluated for two specific modes of generalized linear profiles, including logistic and Poisson profiles, based on a step shift. The simulation results indicate the superiority of this cluster-based method in comparison to the non-clustering method and provide a more accurate estimation of the parameters.

Keywords: Generalized linear models, Phase I, Hotelling T^2, Clustering, robust}

Abstract View Paper Research/Original Article Original: English
ترکیب الگوریتم های جهش قورباغه یی و کرم شب تاب در توسعه الگوریتم مسیریابی چندگامی شبکه های حسگر بی سیم

امیرحسین برزین، احمد صادقیه*، حسن خادمی زارع، محبوبه هنرور

مجله مهندسی صنایع و مدیریت شریف، سال سی و ششم شماره 2 (پاییز و زمستان 1399)، صص 33 -51

محدودیت توان و انرژی در گره‌های حسگر ساختار شبکه‌های حسگر بی‌سیم، طراحی پروتکل مسیریابی کارا انرژی را برای انجام موثر وظایف ارتباطی و پردازشی در دامنه‌ی هدف و بهبود طول عمر، با اهمیت می‌کند. خوشه‌بندی روشی پذیرفته شده برای کارایی انرژی در این شبکه‌هاست. بیشینه‌سازی طول عمر شبکه‌های حسگر بی‌سیم مسیله‌یی NP-hard است. لذا به کمک فراابتکاری‌ها تحقیقات گسترده‌یی برای حل آن انجام شده است. در این نوشتار، الگوریتم مسیریابی چندگامی مبتنی بر خوشه‌بندی از ترکیب الگوریتم جهش قورباغه‌یی و الگوریتم کرم شب‌تاب به‌نام MOFSA پیشنهاد می‌شود. در این رویکرد ابتدا برای یافتن سرخوشه‌ها در فاز خوشه‌بندی و سپس برای یافتن گره‌های باز فرستنده در فاز مسیریابی چندگامی، دو تابع برازندگی چندهدفه ارایه می‌شود. نتایج شبیه‌سازی و مقایسه‌ی عملکرد الگوریتم با پروتکل‌های مسیریابی موجود افزایش شاخص‌های طول عمر شبکه را تا 230 درصد نسبت به LEACH،100 درصد نسبت به EAR، 38 درصد نسبت به SIF و 260 درصد نسبت به FSFLA در سناریوهای پیشنهادی نشان داد.

کلید واژگان: شبکه های حسگر بی سیم, خوشه بندی, مسیریابی چندگامی, الگوریتم جهش قورباغه یی, الگوریتم کرم شب تاب}

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Multihop Clustering Based Routing for Wireless Sensor Networks: A Hybrid Swarm intelligence Based Approach

A.H. Barzin, A. Sadeghieh*, Hassan khademi zare, mahboobeh honarvar

Industrial Engineering & Management Sharif, Volume:36 Issue: 2, 2021, PP 33 -51

Wireless sensor networks (WSN) comprise of a large number of low-power but low-cost small sensing nodes which distributed randomly in a specific area far from the human reach , for the purpose of surveillance, recognition and monitoring the nearby environment based on their inter communication. Each node includes units i.e. sensing, processing, transducing, location positioning and power supply. Owing to various features of sensors such as quickness, self-awareness and self configurability, WSNs have various applications in different areas and many methods are being developed to improve their performance in an application specific way. WSNs face many challenges, including energy restrictions, security, communication reliability, design, and so on. It should be mentioned that it is hardly possible to balance all these challenges due to the conflicts they have with each other. Hitherto, researchers have done extensive studies to bridle these concerns. Sensor nodes are small and have often limited and irreplaceable sources of energy. Furthermore, they can send information at short distances. In long run operations, each node generally does the data collection singly. In this paper, a multi-objective swarm intelligence-based algorithm built on Shuffled frog-leaping and Firefly Algorithm (named MOFSA) is presented as an adaptive clustering-based multi-hop routing protocol for WSNs. MOFSA's multi-objective function regards different criteria (e.g., inter- and intra-cluster distances, residual energy of nodes, distances from the sink, overlap and load of clusters) to select appropriate cluster heads at each round. Moreover, another multi-objective function is proposed to select the forwarder nodes in the routing phase. The controllable parameters of MOFSA in both clustering and multi-hop phases can be adaptively tuned to achieve the best performance based on the network requirements according to the specific application. Simulation outcomes demonstrate average lifetime improvements of 230% compared with LEACH, 100% compared with ERA, 38% compared with SIF and 260% compared with FSFLA in different network scenarios.

Keywords: Wireless Sensor Networks, Clustering, Multi-hop routing, Shuffled Frog Leaping Algorithm, Firefly algorithm}

Abstract View Paper Research/Original Article Original: Persian
Solving a bi-objective medicine distribution problem considering delivery to waste center using a hybrid clustering, mathematical modeling and NSGA-II approach

Masoud Rabani *, Dorsa Abdolhamidi, Mahdi Mokhtarzadeh, Soroush Fatemi Anaraki

Journal of Industrial and Systems Engineering, Volume:13 Issue: 2, Spring 2021, PP 245 -263

Proper transportation and distribution of commodities plays a pivotal role in the expenditures of supply chains. In this paper, a clustered vehicle routing problem with pick-up and delivery is studied. A fleet of distinct vehicles is concurrently responsible for distribution of medicines and collection of their wastes. Collected wastes should be sent to a waste center. To solve the problem, a bi-objective mathematical model is presented. Fairness of travelled distances among drivers and transportation expenses are two objective functions considered in the model. Since the proposed problem is NP-hard, a three-step hybrid approach is developed to solve the problem. First, K-medoids clustering algorithm allocates customers to subsets based on their coordinates. Second, a mathematical model is used for routing vehicles within each cluster. Third, NSGA-II is used to produce final result using the outcome of step 2. Extensive numerical results indicate the superiority of the proposed approach against the NSGA-II.

Keywords: VRP, Fairness, delivery to disposal center, Clustering, NSGA-II}

Abstract View Paper Research/Original Article Original: English
A Data Mining approach for forecasting failure root causes: A case study in an Automated Teller Machine (ATM) manufacturing company

Seyedehpardis Bagherighadikolaei, Rouzbeh Ghousi *, Abdolrahman Haeri

Journal of Optimization in Industrial Engineering, Volume:13 Issue: 28, Summer and Autumn 2020, PP 101 -121

Based on the findings of Massachusetts Institute of Technology, organizations’ data double every five years. However, the rate of using data is 0.3. Nowadays, data mining tools have greatly facilitated the process of knowledge extraction from a welter of data. This paper presents a hybrid model using data gathered from an ATM manufacturing company. The steps of the research are based on CRISP-DM. Therefore, based on the first step, business understanding, the company and its different units were studied. After business understanding, the data collected from sale's unit were prepared for preprocess. While preprocessing, data from some columns of dataset, based on their types and purpose of the research, were either categorized or coded. Then, the data have been inserted into Clementine software, which resulted in modeling and pattern discovery. The results clearly state that, the same Machines’ Code and the same customers in different provinces are struggling with significantly different Problems’ Code, that could be due to weather condition, culture of using ATMs, and likewise. Moreover, the same Machines’ Code and the same Problems’ Code, as well as differences in Technicians' expertise, seems to be some causes to significantly different Repair Time. This could be due to Technicians' training background level of their expertise and such. At last, the company can benefit from the outputs of this model in terms of its strategic decision-making.

Keywords: Data mining, Clustering, Association rules, Classification, Automated Teller Machine (ATM)}

Abstract View Paper Case Study Original: English

نکته

نتایج بر اساس تاریخ انتشار مرتب شده‌اند.
کلیدواژه مورد نظر شما تنها در فیلد کلیدواژگان مقالات جستجو شده‌است. به منظور حذف نتایج غیر مرتبط، جستجو تنها در مقالات مجلاتی انجام شده که با مجله ماخذ هم موضوع هستند.
در صورتی که می‌خواهید جستجو را در همه موضوعات و با شرایط دیگر تکرار کنید به صفحه جستجوی پیشرفته مجلات مراجعه کنید.

به جمع مشترکان مگیران بپیوندید!

جستجوی مقالات مرتبط با کلیدواژه « Clustering » در نشریات گروه « صنایع »