Using Evolutionary Clustering for Topic Detection in Microblogging Considering Social Network Information

Author(s):

E. Alavi , H. Mashayekhi* , H. Hassanpour , B. Rahimpour Kami

Message:

Article Type:

Research/Original Article (دارای رتبه معتبر)

Abstract:

Short texts of social media like Twitter provide a lot of information about hot topics and public opinions. For better understanding of such information, topic detection and tracking is essential. In many of the available studies in this field, the number of topics must be specified beforehand and cannot be changed during time. From this perspective, these methods are not suitable for increasing and dynamic data. In addition, non-parametric topic evolution models lack appropriate performance on short texts due to the lack of sufficient data. In this paper, we present a new evolutionary clustering algorithm, which is implicitly inspired by the distance-dependent Chinese Restaurant Process (dd-CRP). In the proposed method, to solve the data sparsity problem, social networking information along with textual similarity has been used to improve the similarity evaluation between the tweets. In addition, in the proposed method, unlike most methods in this field, the number of clusters is calculated automatically. In fact, in this method, the tweets are connected with a probability proportional to their similarity, and a collection of these connections constitutes a topic. To speed up the implementation of the algorithm, we use a cluster-based summarization method. The method is evaluated on a real data set collected over two and a half months from the Twitter social network. Evaluation is performed by clustering the texts and comparing the clusters. The results of the evaluations show that the proposed method has a better coherence compared to other methods, and can be effectively used for topic detection from social media short texts.

Keywords:

Topic detection , evolutionary clustering , social networks , probabilistic model

Language:

Persian

Published:

Iranian Journal of Electrical and Computer Engineering, Volume:17 Issue: 4, 2020

Pages:

277 to 286

https://magiran.com/p2097699

دانلود و مطالعه متن این مقاله با یکی از روشهای زیر امکان پذیر است:

اشتراک شخصی

با ثبت ایمیلتان و پرداخت حق اشتراک سالانه به مبلغ 1,950,000ريال، بلافاصله متن این مقاله را دریافت کنید.اعتبار دانلود 70 مقاله نیز در حساب کاربری شما لحاظ خواهد شد.

پرداخت حق اشتراک به معنای پذیرش "شرایط خدمات" پایگاه مگیران از سوی شماست.

پست الکترونیکی

اگر مقاله ای از شما در مگیران نمایه شده، برای استفاده از اعتبار اهدایی سامانه نویسندگان با ایمیل منتشرشده ثبت نام کنید. ثبت نام

اشتراک سازمانی

به کتابخانه دانشگاه یا محل کار خود پیشنهاد کنید تا اشتراک سازمانی این پایگاه را برای دسترسی نامحدود همه کاربران به متن مطالب تهیه نمایند!

اطلاعات بیشتر ثبت نام با ایمیل دانشگاهی/سازمانی

توجه!

حق عضویت دریافتی صرف حمایت از نشریات عضو و نگهداری، تکمیل و توسعه مگیران می‌شود.
پرداخت حق اشتراک و دانلود مقالات اجازه بازنشر آن در سایر رسانه‌های چاپی و دیجیتال را به کاربر نمی‌دهد.

In order to view content subscription is required

Personal subscription

Subscribe magiran.com for 70 € euros via PayPal and download 70 articles during a year.

Organization subscription

Please contact us to subscribe your university or library for unlimited access!

More information

علمی مصوب

نشریه مهندسی برق و مهندسی کامپیوتر ایران

Iranian Journal of Electrical and Computer Engineering

فصلنامه فنی مهندسی

آخرین شماره | آرشیو

ISSN: 1682-3745

صاحب امتیاز:

جهاد دانشگاهی

مدیر مسئول:

دکتر حمیدرضا طیبی

سردبیر:

دکتر حمیدرضا صادق محمدی

تلفن نشریه: ۰۲۱-۷۷۸۹۶۶۸۸

اطلاعات بیشتر نشریه

درباره نشریه پیام به نشریه سایت اختصاصی نشریه

به جمع مشترکان مگیران بپیوندید!

Using Evolutionary Clustering for Topic Detection in Microblogging Considering Social Network Information

E. Alavi , H. Mashayekhi* , H. Hassanpour , B. Rahimpour Kami

Topic detection , evolutionary clustering , social networks , probabilistic model

نشریه مهندسی برق و مهندسی کامپیوتر ایران

Iranian Journal of Electrical and Computer Engineering