Machine Learning

Cluster social media content and find out which are closedly related.

Project Background

For better performance, business has the need to understand how the content are related to each other among emojis, hashtags and mentions in social media. Therefore Machine Learning is used for the analysis of similarity and dissimilarity.

Below are some ML analysis result, showing the distance among the content components in the same category.

Challenges & Requirements

To ensure a thorough analysis, it is essential to examine the content based on different categories. This analysis can be made even more insightful by utilizing customized metrics such as the engagement index, which measures how attractive the content is to the audience. Other important metrics to consider include the number of followers the post owner has, the comments received on the post, and the number of likes it has garnered. By clustering the data points into distinct groups, we can identify similarities within each group and distinguish them from other groups, enabling a comprehensive understanding of the data.

Actions & Outcomes

The K-means clustering algorithm is widely recognized as a fundamental unsupervised machine learning technique. By following a series of steps, we are able to generate accurate and insightful results.

The results imply that certain components of the content have a propensity to group together and display similarities.

Incredible developer. I ran late on the information at times but communicated very well and followed up everything perfectly. Amazing developer! 5 stars is not enough!!!

Technologies Used

The data is extracted from BigQuery and transformed into TFRecord format. The K-Means Cluster algorithm is orchestrated using Tensorflow in Python. The model is executed using AWS Jupyter notebook.

Conclusions

By gaining a deeper comprehension of the similarities between content components, we can enhance the performance of our posts by connecting them to similar elements, thereby creating a cohesive narrative. Additionally, we can expand our reach by engaging with distant content components to attract new audiences. As this field is relatively unexplored, it may be necessary to conduct further research and analyze larger data sets in order to validate these findings.

Interested in hiring me for your project?

Looking for an experienced full-stack developer to build your web app or ship your software product? To start an initial chat, just drop me an email at vincent.zhu@storytreat.com or use the form on the contact page.