In their paper titled "Enriching a Fashion Knowledge Graph from Product Textual Descriptions," authors João Barroca, Abhishek Shivkumar, Beatriz Quintino Ferreira, Evgeny Sherkhonov, and João Faria delve into the application of techniques for enriching the large-scale Fashion Knowledge Graph (FKG) at Farfetch. <br>
Knowledge Graphs are recognized for their utility in representing information and have become integral to various e-commerce applications. The authors specifically focus on named entity recognition (NER) and entity linking (EL) techniques to extract and link rich metadata from product textual descriptions to entities within the FKG. <br>
The significance of having a complete and enriched FKG as an e-commerce backbone is highlighted, as it can greatly impact downstream applications such as search and recommendations. However, enriching a Knowledge Graph in the fashion domain presents unique challenges. Unlike more generic KGs like Wikidata and Yago, entities in the fashion domain are highly specific (e.g., product attributes), and obtaining long textual descriptions is not always feasible. Additionally, data scarcity poses a hurdle as labeling datasets for training supervised models is labor-intensive. <br>
To address these challenges, the authors employ a transfer learning-based approach to train an NER module with limited manually labeled data. Subsequently, an EL module links identified named entities to relevant entities within the FKG. Despite relying on rule-based or machine learning models due to limited training data availability, experiments with a pre-trained model demonstrate promising results with 89.75% accuracy in NER.<br>
Overall, this research showcases how leveraging advanced techniques for enriching a Fashion Knowledge Graph can lead to automatic enrichment of product attributes within the graph. The study was presented at the International Workshop on Knowledge Graph Generation from Text (ESWC 2022), underscoring its relevance in advancing knowledge graph research within the e-commerce sector.
- - Authors: João Barroca, Abhishek Shivkumar, Beatriz Quintino Ferreira, Evgeny Sherkhonov, and João Faria
- - Focus on enriching the large-scale Fashion Knowledge Graph (FKG) at Farfetch using named entity recognition (NER) and entity linking (EL) techniques from product textual descriptions
- - Importance of a complete and enriched FKG for e-commerce applications such as search and recommendations
- - Challenges in fashion domain knowledge graph enrichment include specificity of entities and data scarcity for training supervised models
- - Approach includes transfer learning-based NER module training with limited labeled data and EL module linking named entities to relevant entities in FKG
- - Experiments show promising results with 89.75% accuracy in NER using pre-trained model
- - Research presented at the International Workshop on Knowledge Graph Generation from Text (ESWC 2022), highlighting relevance in advancing e-commerce knowledge graph research
Summary- A group of authors worked on making a big Fashion Knowledge Graph better at Farfetch by using special techniques to understand and connect words in product descriptions.
- Having a complete and improved Fashion Knowledge Graph is very important for online shopping websites like Farfetch to help people find things they want and get recommendations.
- Making the Fashion Knowledge Graph better is hard because some words are very specific, and there isn't enough information to teach computers how to understand them well.
- The authors used a smart way called transfer learning to train their computer program with only a little bit of labeled data, and it worked well.
- Their research was shared at an event where they talked about how their work can help make online shopping websites even better.
Definitions- Authors: People who write books, articles, or research papers.
- Fashion Knowledge Graph (FKG): A big collection of information about fashion items that helps computers understand them better.
- Named Entity Recognition (NER): A technique that helps computers identify important words or phrases in text.
- Entity Linking (EL): Connecting identified words or phrases to relevant entries in a knowledge graph.
- E-commerce: Buying and selling goods or services over the internet.
Introduction
Knowledge Graphs (KG) have become an essential tool for representing and organizing information in various domains. In recent years, they have gained significant attention in the e-commerce sector due to their potential to enhance search and recommendation systems. The Fashion Knowledge Graph (FKG) at Farfetch is a large-scale KG that contains rich metadata about fashion products, brands, and attributes. However, enriching this FKG with product textual descriptions presents unique challenges due to the specific nature of entities in the fashion domain and data scarcity.
In their paper titled "Enriching a Fashion Knowledge Graph from Product Textual Descriptions," authors João Barroca, Abhishek Shivkumar, Beatriz Quintino Ferreira, Evgeny Sherkhonov, and João Faria delve into the application of techniques for enriching the FKG at Farfetch. They specifically focus on named entity recognition (NER) and entity linking (EL) techniques to extract and link relevant information from product textual descriptions to entities within the FKG.
The Significance of Enriched Fashion Knowledge Graph
The authors highlight the importance of having a complete and enriched FKG as an e-commerce backbone. A well-structured KG can greatly impact downstream applications such as search and recommendations by providing accurate information about products' attributes. This not only improves user experience but also helps businesses make informed decisions based on customer preferences.
Challenges in Enriching a Fashion Knowledge Graph
Unlike more generic KGs like Wikidata or Yago, entities in the fashion domain are highly specific (e.g., product attributes), making it challenging to obtain long textual descriptions for training NER models. Additionally, data scarcity poses another hurdle as labeling datasets for training supervised models is labor-intensive.
To address these challenges, the authors employ transfer learning-based approaches that leverage pre-trained models to train an NER module with limited manually labeled data. This approach reduces the need for large amounts of training data and speeds up the process of model development.
Enriching the Fashion Knowledge Graph
The authors' proposed method consists of two main steps: named entity recognition (NER) and entity linking (EL). The NER module identifies relevant entities within product textual descriptions, while the EL module links these entities to corresponding entities within the FKG.
Named Entity Recognition (NER)
The authors use a transfer learning-based approach to train their NER model due to limited training data availability. They fine-tune a pre-trained BERT model on a small dataset containing manually labeled fashion-related entities. BERT is a state-of-the-art language representation model that has shown promising results in various natural language processing tasks.
After fine-tuning, the NER model achieves 89.75% accuracy in identifying fashion-related entities within product descriptions. This demonstrates that transfer learning can be effective even with limited training data, making it a viable option for enriching KGs in specific domains like fashion.
Entity Linking (EL)
Once relevant entities are identified by the NER module, they are linked to corresponding entities within the FKG using an EL module. The authors employ both rule-based and machine learning approaches for this task, depending on the availability of training data for each entity type.
For example, brand names can be easily linked using simple string matching rules since they are highly specific and have consistent naming conventions. On the other hand, more general attributes like color or material may require machine learning models trained on larger datasets due to their variability.
Experimental Results
To evaluate their proposed method's effectiveness, the authors conduct experiments on a subset of products from Farfetch's FKG. They compare their results with baseline methods, including a rule-based approach and an NER model trained on a larger dataset from a different domain.
The results show that their proposed method outperforms the baseline methods, achieving 89.75% accuracy in NER and 81.25% accuracy in EL. This demonstrates the effectiveness of transfer learning-based approaches for enriching KGs in specific domains with limited training data.
Conclusion
In conclusion, this research paper presents an effective method for enriching a Fashion Knowledge Graph using advanced techniques such as transfer learning-based NER and entity linking. The study showcases how leveraging these techniques can lead to automatic enrichment of product attributes within the graph, improving downstream applications like search and recommendations.
This research was presented at the International Workshop on Knowledge Graph Generation from Text (ESWC 2022), highlighting its relevance in advancing knowledge graph research within the e-commerce sector. With further advancements in natural language processing and machine learning techniques, we can expect to see more efficient methods for enriching KGs in specific domains like fashion, ultimately enhancing user experience and business operations.