AI-Powered Data Solutions
AI-Powered Data Solutions

Analyzing VDI Recordings to Detect Unauthorized Access

A financial institution identified that certain sales personnel had exploited system vulnerabilities to access confidential customer information. With access logs for some periods no longer available, the company opted to review Virtual Desktop Infrastructure (VDI) recordings from over 5,000 employees over the past one year to identify suspicious activity and quantify the scope of the data breach.

Primidea deployed advanced machine vision algorithms to tackle the vast dataset efficiently. By analyzing system interface features and combining template matching with object detection, we accurately pinpointed relevant screen content. Optical Character Recognition (OCR) technology was then employed to extract critical data fields from the identified frames.

This approach successfully revealed dozens of employees involved in unauthorized access and uncovered more than 10,000 compromised customer records. Our solution provided the client with comprehensive insights to strengthen their data security framework and enhance compliance protocols.

Detecting Bribery Evidence from Chat App Image Caches

A whistleblower alerted a multinational company to allegations that over ten employees were involved in leaking and selling confidential company data. The company launched an internal investigation and engaged Primidea to analyze the employees’ electronic data.

Upon examining the work computers of the implicated employees, Primidea uncovered hundreds of thousands, even millions, of cached images on each device. A manual review of this scale would have been highly inefficient and risked overlooking critical evidence. To streamline the process, Primidea’s technical team implemented an AI-powered image classification system.

The team first categorized and prioritized images most likely to contain relevant evidence, identified key categories, and created a robust training dataset. Leveraging this dataset, we developed a precise image classification model. With AI, investigators were able to review the immense volume of images in just a few days.

The investigation uncovered pivotal evidence, including:

  • Screenshots of employees sharing company data via WeChat.
  • Screenshots of employees receiving bribes through WeChat or bank transfers.

These findings were instrumental in helping the company address the breaches and mitigate further losses, demonstrating the power of AI in streamlining complex investigations.

Primidea’s Technical Expertise

Shanghai Primidea Technology Co., Ltd. (“Primidea”) has dedicated years to advancing machine learning through rigorous research. Its team has consistently excelled in prestigious international AI competitions.

Primidea’s team members have demonstrated exceptional problem-solving capabilities in top-tier contests on platforms such as Kaggle and Alibaba Cloud Tianchi, tackling complex challenges in computer vision and data mining. Among its achievements, the team earned a global ranking of 59th in Kaggle’s overall competition standings—a testament to its expertise and dedication.

With extensive experience in AI algorithm design and development, Primidea continues to push the boundaries of innovation. The company is committed to providing clients with cutting-edge, precise, and efficient technical solutions tailored to their needs.

Selected Representative Competition Achievements

Project Award Platform
1 CVPR  Image Matching Challenge 2022 Gold Medal Kaggle
2 ION GNSS+ Google Smartphone Decimeter Challenge 2022 Gold Medal Kaggle
3 UNiLAB Algorithm Competition:SRP for high-frequency load data Runner-Up Tianchi

Overview of Applied Technologies

By integrating advanced artificial intelligence techniques such as Computer Vision, Signal Processing, Pattern Recognition, Natural Language Processing, and Data Mining, Primidea provides in-depth analysis and efficient processing of multimodal data.

Image and Video

  • Image Classification: Extracts features of objects or scenes in images and assigns them to predefined categories.
  • Image Retrieval: Includes reverse image search and text-based image search by extracting features from images or textual descriptions and computing their similarity to images in a database for quick retrieval.
  • Face Recognition: Identifies or verifies individual identities by analyzing facial features in images.
  • Image Tampering Detection: Detects traces of edits or forgeries in images to ensure their authenticity.
  • Deepfake Detection: Uses AI to identify and prevent deepfake content in images, videos, or audio, safeguarding against fraud and misinformation.
  • Object Detection and Semantic Segmentation: Identifies and locates objects in images or videos, marking regions and assigning classifications. Semantic segmentation further analyzes images on a pixel level to assign category labels to each pixel.
  • Object Tracking: Tracks the spatial movement of one or more objects in a sequence of image frames.
  • Pose Estimation: Estimates spatial positions of key points, such as joints or skeletal structures, in humans or objects from images or videos.
  • Video Event Detection: Analyzes sequences of video frames to identify clips related to predefined events or actions.

Audio

  • Automatic Speech Recognition (ASR): Converts human speech into readable text or commands, accommodating various languages, accents, and noisy environments.
  • Speech Sentiment Analysis: Analyzes tone, speed, and other vocal characteristics to identify emotional states, aiding in voice interactions and emotion detection.
  • Voiceprint/Speaker Recognition: Verifies or identifies individual identities through unique vocal characteristics.
  • Audio Event Detection: Detects and classifies specific sound events, such as alarms or traffic noise, by analyzing audio signal features.

Text

  • Optical Character Recognition (OCR): Extracts textual information from images and converts it into editable text, enabling document digitization and processing.
  • Information Extraction: Automatically extracts structured data from large volumes of text, identifying entities, relationships, and events for analysis and decision-making.
  • Sentiment Analysis: Classifies the emotional tone of text to understand customer feedback, social media sentiment, and more.
  • Automatic Summarization: Uses natural language processing to extract key points from text and generate concise summaries.
  • Text Retrieval: Matches and retrieves text based on similarity scores, enabling quick discovery of relevant content in large-scale text databases.

Other Capabilities

  • Knowledge Graph: Structures entities and their relationships in graph data, enabling complex relational analysis and knowledge expansion through inference techniques like graph neural networks.