The post GitHub Copilot Enhances Code Search with New Embedding Model appeared on BitcoinEthereumNews.com. Ted Hisokawa Sep 26, 2025 03:41 GitHub introduces a new Copilot embedding model, enhancing code search in VS Code with improved accuracy and efficiency, according to GitHub’s announcement. GitHub has announced a significant upgrade to its Copilot tool, introducing a new embedding model that promises to enhance code search within Visual Studio Code (VS Code). This development aims to make code retrieval faster, more memory-efficient, and significantly more accurate, as detailed in a recent GitHub blog post. Enhanced Code Retrieval The new Copilot embedding model brings a 37.6% improvement in retrieval quality, doubling the throughput and reducing the index size by eight times. This means developers can expect more accurate code suggestions, faster response times, and reduced memory usage in VS Code. The model effectively provides the correct code snippets needed, minimizing irrelevant results. Why the Upgrade Matters Efficient code search is crucial for a seamless AI coding experience. Embeddings, which are vector representations, play a key role in retrieving semantically relevant code and natural language content. The improved embeddings result in higher retrieval quality, thereby enhancing the overall GitHub Copilot experience. Technical Improvements GitHub has trained and deployed this new model specifically for code and documentation, enhancing context retrieval for various Copilot modes. The update has shown significant improvements, with C# developers experiencing a 110.7% increase in code acceptance ratios and Java developers seeing a 113.1% rise. Training and Evaluation The model was optimized using contrastive learning techniques, such as InfoNCE loss and Matryoshka Representation Learning, to improve retrieval quality. A key aspect of the training involved using ‘hard negatives’—code examples that appear correct but are not—helping the model distinguish between nearly correct and actually correct code snippets. Future Prospects GitHub plans to expand its training and evaluation data to include… The post GitHub Copilot Enhances Code Search with New Embedding Model appeared on BitcoinEthereumNews.com. Ted Hisokawa Sep 26, 2025 03:41 GitHub introduces a new Copilot embedding model, enhancing code search in VS Code with improved accuracy and efficiency, according to GitHub’s announcement. GitHub has announced a significant upgrade to its Copilot tool, introducing a new embedding model that promises to enhance code search within Visual Studio Code (VS Code). This development aims to make code retrieval faster, more memory-efficient, and significantly more accurate, as detailed in a recent GitHub blog post. Enhanced Code Retrieval The new Copilot embedding model brings a 37.6% improvement in retrieval quality, doubling the throughput and reducing the index size by eight times. This means developers can expect more accurate code suggestions, faster response times, and reduced memory usage in VS Code. The model effectively provides the correct code snippets needed, minimizing irrelevant results. Why the Upgrade Matters Efficient code search is crucial for a seamless AI coding experience. Embeddings, which are vector representations, play a key role in retrieving semantically relevant code and natural language content. The improved embeddings result in higher retrieval quality, thereby enhancing the overall GitHub Copilot experience. Technical Improvements GitHub has trained and deployed this new model specifically for code and documentation, enhancing context retrieval for various Copilot modes. The update has shown significant improvements, with C# developers experiencing a 110.7% increase in code acceptance ratios and Java developers seeing a 113.1% rise. Training and Evaluation The model was optimized using contrastive learning techniques, such as InfoNCE loss and Matryoshka Representation Learning, to improve retrieval quality. A key aspect of the training involved using ‘hard negatives’—code examples that appear correct but are not—helping the model distinguish between nearly correct and actually correct code snippets. Future Prospects GitHub plans to expand its training and evaluation data to include…

GitHub Copilot Enhances Code Search with New Embedding Model

2025/09/27 19:29


Ted Hisokawa
Sep 26, 2025 03:41

GitHub introduces a new Copilot embedding model, enhancing code search in VS Code with improved accuracy and efficiency, according to GitHub’s announcement.





GitHub has announced a significant upgrade to its Copilot tool, introducing a new embedding model that promises to enhance code search within Visual Studio Code (VS Code). This development aims to make code retrieval faster, more memory-efficient, and significantly more accurate, as detailed in a recent GitHub blog post.

Enhanced Code Retrieval

The new Copilot embedding model brings a 37.6% improvement in retrieval quality, doubling the throughput and reducing the index size by eight times. This means developers can expect more accurate code suggestions, faster response times, and reduced memory usage in VS Code. The model effectively provides the correct code snippets needed, minimizing irrelevant results.

Why the Upgrade Matters

Efficient code search is crucial for a seamless AI coding experience. Embeddings, which are vector representations, play a key role in retrieving semantically relevant code and natural language content. The improved embeddings result in higher retrieval quality, thereby enhancing the overall GitHub Copilot experience.

Technical Improvements

GitHub has trained and deployed this new model specifically for code and documentation, enhancing context retrieval for various Copilot modes. The update has shown significant improvements, with C# developers experiencing a 110.7% increase in code acceptance ratios and Java developers seeing a 113.1% rise.

Training and Evaluation

The model was optimized using contrastive learning techniques, such as InfoNCE loss and Matryoshka Representation Learning, to improve retrieval quality. A key aspect of the training involved using ‘hard negatives’—code examples that appear correct but are not—helping the model distinguish between nearly correct and actually correct code snippets.

Future Prospects

GitHub plans to expand its training and evaluation data to include more languages and repositories. The company is also refining its hard negative mining pipeline to enhance quality further, with goals to deploy larger, more accurate models leveraging the efficiency gains from this update.

This latest enhancement is a step towards making AI coding assistants more reliable and efficient for developers, promising a smarter and more dependable tool for everyday development.

Image source: Shutterstock


Source: https://blockchain.news/news/github-copilot-enhances-code-search-with-new-embedding-model

면책 조항: 본 사이트에 재게시된 글들은 공개 플랫폼에서 가져온 것으로 정보 제공 목적으로만 제공됩니다. 이는 반드시 MEXC의 견해를 반영하는 것은 아닙니다. 모든 권리는 원저자에게 있습니다. 제3자의 권리를 침해하는 콘텐츠가 있다고 판단될 경우, service@support.mexc.com으로 연락하여 삭제 요청을 해주시기 바랍니다. MEXC는 콘텐츠의 정확성, 완전성 또는 시의적절성에 대해 어떠한 보증도 하지 않으며, 제공된 정보에 기반하여 취해진 어떠한 조치에 대해서도 책임을 지지 않습니다. 본 콘텐츠는 금융, 법률 또는 기타 전문적인 조언을 구성하지 않으며, MEXC의 추천이나 보증으로 간주되어서는 안 됩니다.