Margaret Rouse is an award-winning technical writer and teacher known for her ability to explain complex technical subjects simply to a non-technical, business audience. Over…
Natalie is an editor specializing in educational content, with a deep passion for technology and cryptocurrency. Her expertise lies in transforming complex tech and crypto…
Windows AI Studio is a software development environment that allows developers to build generative AI apps and deploy them locally on digital devices that run Windows 11.
Microsoft officially announced Windows AI Studio in October 2023 during its annual Microsoft Ignite conference.
The release of Windows AI Studio marks a significant step in Microsoft’s efforts to democratize generative AI and make it accessible to a broader range of developers and users.
It also highlights the growing need for generative AI to function on edge devices and in regions with limited, unreliable, or no internet connectivity.
Windows AI Studio has a simple workflow that allows developers to build, train, and deploy small language models (SLMs) without having to use cloud infrastructure.
Developers access Windows AI Studio through Visual Studio Code, a lightweight open-source code editor developed by Microsoft.
Microsoft’s strategy is to allow developers to take advantage of their familiarity with Visual Studio Code’s editing environment while benefiting from the generative AI capabilities of Azure AI Studio. This approach aims to make generative AI more accessible and less intimidating for developers with varying skill levels.
The first thing the developer will be asked to do is select a pre-trained small language model from a curated list provided by Hugging Face and Azure. This is an important step that new developers should plan on spending time on.
Many of the pre-trained SLMs provided by HuggingFace and Azure are designed for discrete tasks, such as text classification, sentiment analysis, machine translation, or question answering.
Whatever pre-trained model the developer chooses, it should be designed for the task(s) the developer is building their Windows app to carry out, and the model should be able to function well on battery-powered computing devices that have limited power, storage, and processing resources.
The next thing the developers will be asked to do is use their own data to fine-tune the pre-trained model they selected.
This process involves adjusting the model’s parameters to optimize it for the application the developer is building. Although it may seem as if this step should take a lot of time, it doesn’t.
The Windows AI Studio user interface (UI) makes it easy to adjust parameters by providing the developer with sliders, buttons, and other low code/no code (LCNC) dashboard elements.
On the back end, behind the scenes, the developer is actually using an integrated development environment (IDE) called Olive and a technique called QLoRA (Quantized Low-Rank Adapters) to adjust parameters.
Once the model has been fine-tuned and optimized, it is converted from its native format into the Open Neural Network Exchange (ONNX) format so the model and its dependencies can be used by the developer’s Windows application. The application will use ORT (Onnx Runtime) to execute the model during inference.
Now that generative AI has become cheaper and easier to use, people are discovering new uses for the technology.
Researchers have been exploring techniques like model compression, quantization, and knowledge distillation to reduce the computational footprint of large language models (LLMs), but they are still not suitable for direct deployment on personal computers, tablets, mobile phones, kiosks, and other types of Internet of Things (IoT) edge devices.
Until now, on-device AI has been a challenge. Microsoft’s investment in Windows AI Studio is expected to be a key driver for running small language models locally and creating new use cases for on-device generative AI.
If Microsoft is successful, developers will be able to:
Techopedia’s editorial policy is centered on delivering thoroughly researched, accurate, and unbiased content. We uphold strict sourcing standards, and each page undergoes diligent review by our team of top technology experts and seasoned editors. This process ensures the integrity, relevance, and value of our content for our readers.
Margaret is an award-winning technical writer and teacher known for her ability to explain complex technical subjects to a non-technical business audience. Over the past twenty years, her IT definitions have been published by Que in an encyclopedia of technology terms and cited in articles by the New York Times, Time Magazine, USA Today, ZDNet, PC Magazine, and Discovery Magazine. She joined Techopedia in 2011. Margaret's idea of a fun day is helping IT and business professionals learn to speak each other’s highly specialized languages.
What is Turnitin AI Checker? The Turnitin AI checker is an advanced tool aimed at maintaining the integrity of school...
Maria WebbTechnology journalist
What is ISO/IEC 42001? ISO/IEC 42001 is an international standard that provides a governance framework for implementing and continually improving...
Margaret RouseTechnology Expert
What are Physical Resource Networks (PRNs)? The definition of Physical Resource Networks (PRNs) is that they are a type of...
Nicole WillingTechnology Journalist
Trending NewsLatest GuidesReviewsTerm of the Day