Generative AI 101 Slides | GenAI Guide

Agenda - Overview, history, current state, whats coming, limitations and mitigations

GenAI powered Creatvity, Automation and Personalization.

Generative AI is reshaping industries by enabling machines to create content, code, and insights — driving innovation, automation, and personalization at scale. Businesses must understand the technology, evolving frameworks, costs, and vendor ecosystem to build effective GenAI solutions responsibly. As adoption grows, addressing challenges like bias, data privacy, and governance will be key to unlocking GenAI’s full potential.

20 Latest Update of Generative AI

Llama 3 released. Google code Assist announced

Last month update - generative AI update

Generative AI is rapidly reshaping how creativity, business, and education intersect, with significant applications emerging across sectors:

Arts & Culture: The technology is breathing new life into lost art, demonstrated by a project to restore missing scenes from a 1940s Orson Welles film.

Entertainment: AI is becoming a major force in creative production, with about 20% of new games on Steam now using it for visuals, storytelling, or interactive features.

Business & IT: Companies are adopting AI to guide strategy, manage risk, and simplify the modernization of aging mainframe systems, particularly in compliance-heavy regions like Europe.

Education: Universities like Duke are proactively testing custom AI tools to explore the benefits and navigate the risks of AI-assisted learning.

Dataknobs is leveraging generative AI to build advanced data products such as stock market signals and equipment health indices, while also creating integrated knowledge solutions that combine knowledge generation, website development, and intelligent chatbot creation.

Gen AI for IT Systems

Apply . GenAI in IT System

A CIO building an annual budget for generative AI should start by identifying high-impact, low-risk areas where AI can be applied within IT systems, such as automating service desk requests, accelerating code development, and improving system monitoring. By prioritizing these internal use cases, the CIO can create measurable cost savings and efficiency gains that justify further investment. The budget should allocate funds not only for technology adoption but also for workforce training, governance, and security, ensuring that generative AI is implemented responsibly and delivers sustainable value.

Gen AI offering/update from Vendors

Closed Model From Hyper scalers

Google, Microsoft, and AWS are all rapidly expanding their generative AI ecosystems to capture enterprise and consumer demand.

Google Gemini: Bringing multimodal AI into everyday life with advanced tools like Flash image editing and by replacing Google Assistant across devices, making AI more integrated into both personal and professional environments.
Microsoft Azure: Focusing on enterprises by combining OpenAI’s models with Azure AI Foundry, AI Search, and strong governance frameworks, enabling organizations to securely build, customize, and scale generative AI solutions.
AWS: Expanding its Bedrock platform with foundation models, the AgentCore framework for AI agents, a third-party AI tools marketplace, and vector-optimized storage—helping developers train, deploy, and scale AI workloads cost-effectively.

Together, these offerings highlight how cloud providers are not only competing but also shaping the future of AI-driven innovation across industries.

Gen AI offering fron Non Hyper Scalers

Other options OpenAI Meta and Hugging Face

OpenAI
- Released GPT-5 with stronger reasoning, coding, and multimodal capabilities, adopted at large scale.
- Introduced open-weight models (gpt-oss-120b, gpt-oss-20b) under Apache 2.0 for flexible, offline-friendly development.
Anthropic (Claude)
- Advanced the Claude family (e.g., Claude 3.7 Sonnet, Claude 4) for deeper reasoning, coding, and tool use.
- Added Claude Code for enterprises and a Claude agent for Chrome to boost productivity in the browser.
Meta Llama (via Hugging Face)
- Offers Llama 3 and Llama 4 variants (8B–402B), including multimodal and MoE-optimized options.
- Accessible on Hugging Face for customization, fine-tuning, and open-source innovation.

GenAI vs Predictive AI

Gen AI creates, Predicitve AI classify ability to create make GenAI very powerful

Generative AI focuses on creating new content—such as text, images, or code—by learning patterns from data and producing original outputs. Predictive AI, on the other hand, analyzes historical data to forecast future outcomes, trends, or behaviors without generating new content.

Large Language Model

LLMs are build by training on large text data . These are useful to generate text, summarize document, answer questions, reason, translate, do SEO of web pages

A Large Language Model (LLM) is an advanced AI system trained on vast datasets to understand, process, and generate human-like language. Beyond basic conversation, LLMs are widely applied in code generation, CSS design automation, and a range of NLP tasks such as document analysis, summarization, and entity extraction. They also enable structured data analysis, helping organizations interpret complex datasets, uncover insights, and automate workflows. With these capabilities, LLMs power applications across industries—from intelligent chatbots and virtual assistants to advanced developer tools and enterprise knowledge systems.

How to use LLMs and build applications

Consideration while selecting - Open Source vs Closed LLM Closed Model can be access via API and have a cost. Open Model have less cost, more control but may require significant work

Closed-source LLMs, like OpenAI’s GPT-5 or Anthropic’s Claude, are proprietary models where access is limited through APIs or subscriptions, offering high performance but little transparency or customization. Open-source LLMs, such as Meta’s Llama or Mistral, make their model weights publicly available, enabling developers to fine-tune, deploy, and innovate freely, often with strong community support.

LLMs building blocks are Data, Architecture, Training, Inference

While making decisions about Closed vs Open model you should consider above factors, control needed, IP risks etc

Lllama and Hugging Face are 2 famous open source model. ChatGPT, Bard, Palm , Claude are closed models.

LLM Comparision Criteria

Consideration while comparing - LLM for accuracy and size More complexity and size can be good or bad

8 factors to consider for comparing LLMs

Comparison Dimension	Description
Model Size (# of Parameters)	Number of parameters (e.g., billions/trillions) affecting complexity, quality, and compute needs.
Accuracy & Benchmark Performance	Scores on MMLU, GSM8K, HumanEval, etc., indicating reasoning, coding, and knowledge strength.
Open vs. Closed Source	Open-weight models allow full customization and self-hosting; closed models offer managed APIs and guardrails.
Cost & Efficiency	Inference and training costs, token pricing, and hardware efficiency (GPU/TPU utilization).
Latency & Speed	Response time and throughput—critical for real-time chat, copilots, and streaming UX.
Multimodality	Support for text, images, audio, and video in input/output versus text-only models.
Context Window Length	Maximum tokens the model can consider at once—important for long documents and multi-turn sessions.
Fine-Tuning & Customization	Options for fine-tuning, adapters, or parameter-efficient methods to fit domain tasks.
Governance & Safety	Content filtering, policy guardrails, auditability, and compliance features.
Ecosystem & Tool Integration	SDKs, plugins, RAG/search tools, and cloud integrations (Azure, AWS, Google Cloud, Hugging Face).

Guardrails for GenAI LLM and Chatbots

Safety for LLM and GenAI

Guardrails are essentially guidelines and controls that steer the LLM's outputs in the desired direction.

Here are some ways to keep your LLM on track:

Input Validation: Set criteria for what kind of information the LLM can process, preventing nonsensical or malicious inputs.
Output Filtering: Review and potentially edit the LLM's outputs before they are used, catching any biases or factual errors.
Real-time Monitoring: Continuously track how the LLM is being used and intervene if it generates harmful content.
Human oversight: Ensure humans are always involved in the LLM interaction

Guardrail	Practices
Clear use-case & policy scope	Define allowed/blocked intents, refusal rules, and escalation paths.
Privacy & data minimization	Strip/obfuscate PII, control retention, encrypt data in transit and at rest.
Input validation & prompt hardening	Templatize prompts, sanitize user/tool inputs, and block jailbreak patterns.
Prompt-injection defenses	Filter untrusted content (RAG docs, web pages) and isolate it from system instructions.
Grounding to trusted sources	Use RAG/knowledge graphs; require citations and confidence signals to curb hallucinations.
Safety & content filters	Toxicity, hate, self-harm, IP/PII leakage, and malware classifiers on inputs and outputs.
Bias & fairness checks	Run bias evaluations, adjust datasets/prompts, and document mitigations.
Human-in-the-loop gates	Require review for high-risk actions (code deploys, financial trades, customer emails).
Tool/agent safety	Least-privilege keys, allow/deny lists, timeouts, cost/iteration budgets, sandboxed execution.
Access control & auditability	RBAC, per-tenant isolation, comprehensive logs, and immutable audit trails.
Rate limiting & abuse prevention	Quotas, anomaly detection, circuit breakers/kill switches.
Evaluation & red-teaming	Task-specific benchmarks, adversarial testing, and pre-prod safety bars.
Monitoring & drift detection	Track quality, safety incidents, latency/cost; alert on regressions and model drift.
Change management	Version prompts/models/tools, run A/B or shadow tests, maintain rollback plans.
Compliance & provenance	Map to SOC2/ISO/GDPR/CCPA; add watermarking or provenance tags where applicable.

Foundation Model - Pretrain once, Adapt Everywhere

Act as platform. They can be use as out of box and advanced model can be trained on these.

A foundation model is a big, pre-trained AI you can quickly adapt to many jobs—chat, search, code, document analysis—without starting from scratch. Examples include OpenAI GPT-4o, Anthropic Claude 3.5, Google Gemini 1.5 Pro, Meta Llama 3 (70B), and Mistral Large. From your perspective, you plug one into your app, point it at your knowledge (via RAG or a small fine-tune), set guardrails, and immediately start asking it to draft emails, summarize PDFs, write SQL, or generate UI/CSS—while you monitor cost, latency, and quality with simple knobs.

Foundation model work out of box for universal scenarios. As FMs are exposed to internet scale data and various forms and myriad of patterns, FMs learn to apply their knowledge within a wide range of contexts. Reduce the labeling requirements

Consideration to Extend Foundation Model

Consider extending Foundation model for compettive advantage

Decision Matrix: Extend Foundation Model or Not (Domain-Agnostic)

Dimension	Stay with Base Foundation Model	Extend Foundation Model (Fine-tune / LoRA / RAG)
Accuracy & Fit	Good for general use; weaker on niche tasks.	Stronger performance in specialized domains or tasks.
Differentiation	Limited uniqueness; competitors can replicate.	Proprietary advantage and domain-specific strength.
Speed to Market	Fast to deploy.	Slower — requires data curation and training.
Data Needs	Minimal/no custom dataset needed.	Requires curated, representative domain data.
Maintainability	Easy to upgrade as base model improves.	Harder to maintain; fine-tuned models can get stale.
Compliance / Risk	Higher risk of inaccuracies in sensitive domains.	Lower risk if grounded in curated, audited data.
Cost	Lower cost (no training, cheaper inference).	Higher cost (training + possible inference overhead).
Scalability	Efficient, lightweight.	May increase latency/compute needs (depends on method).

👉 In short:

Don’t extend if you need speed, low cost, and broad/general use.
Extend if you need domain-specific accuracy, differentiation, and compliance reliability — and you have the data to support it.

Prompt Engineering

Design input in natural language to get results you want from LLM

Prompt engineering is the practice of designing and refining the inputs (prompts) given to a foundation model so that it produces reliable, accurate, and useful outputs without retraining the model. It involves carefully choosing wording, structure, and context—such as providing role instructions (“You are an expert lawyer”), formatting examples, or step-by-step reasoning cues—to guide the model’s behavior. Effective prompt engineering can significantly improve performance on specific tasks, reduce hallucinations, and tailor responses to user needs. It is often the fastest, lowest-cost way to adapt a foundation model before considering heavier approaches like retrieval augmentation or fine-tuning.

Retrieval-Augmented Generation (RAG)

Add memory to LLM and GenAI

Retrieval-Augmented Generation. It's a technique that helps large language models. Before answering your question, the LLM uses RAG to search a vast external knowledge base for relevant information.
With this extra knowledge, the LLM can provide more accurate and informative answers.
RAG combines the vast knowledge of LLMs with your data, enhancing AI's ability to provide contextually rich responses.

RAG is extremly ueful when you need to answer question on specific domain, latest and up to date information or use internal knowledge. Use RAG is
Domain-specific knowledge: If your assistant needs to be an expert in a specific domain, RAG can be used to integrate relevant databases or knowledge repositories to enhance its understanding.
Accuracy is crucial: If your LLM assistant needs to provide highly accurate information, especially on factual topics like science, history, or specific procedures, RAG can ensure responses are grounded in real-world knowledge.
Combating hallucinations: LLMs can sometimes make up information, called hallucination. RAG combats this by providing verifiable evidence to support the response.
Building trust: By allowing users to see where the information comes from (think footnotes!), RAG fosters trust and transparency in the assistant's responses.
The key advantage of RAG is flexibility. Unlike fine-tuning, where knowledge is baked into the model weights, retrieval can be updated dynamically—simply by adding or changing documents in the external source. This makes it ideal for domains with fast-changing information (like regulations, product catalogs, or research) and for scenarios where data privacy requires strict control over what the model can access.

AI Assistant Tradeoff Factors

You want better accuracy, performance and cost. However priortize what is most important

Trade Off Factors For AI Assistants

3 Dimensions

Accuracy : Lead to user Trust and Effectiveness: Higher accuracy in understanding and responding to user queries builds trust and reliability. Accurate AI assistants can effectively handle complex tasks, providing precise information and solutions.
Performance : Better Speed and Scale lear to quick response. High-performance AI assistants provide quick responses, improving user experience and efficiency. Efficient performance ensures that the AI can handle numerous requests simultaneously, essential for scaling operations.
Cost : Lead to adoption and ROI generation. Lower costs can make AI assistants accessible to a broader range of users and businesses. Affordable solutions encourage wider adoption, enabling more industries to benefit from AI.

Unleash the power of Similarity Search

Vector DB and Vector Embeddings

Vector DB store date in vector embeddings which are high dimension vectors that represent features or attributes of data. Traditional DB store dat in rows and columns. In traditional db each columen has signle field and each row is a record.

A vector database is a specialized system designed to store, index, and search high-dimensional vectors—numerical representations of data such as text, images, audio, or video. These vectors are generated by machine learning models (like embeddings from LLMs) and capture the semantic meaning of the data. Vector databases use algorithms like approximate nearest neighbor (ANN) search to quickly find the most similar vectors, enabling tasks such as semantic search, recommendation systems, document retrieval, and anomaly detection. Popular vector databases include Pinecone, Weaviate, Milvus, and FAISS.

You would use a vector database over a traditional database when the goal is to retrieve information based on meaning or similarity rather than exact matches. Traditional databases excel at structured queries (e.g., looking up rows by exact keys, ranges, or filters), but they struggle with unstructured or fuzzy data like “find documents similar to this one” or “recommend songs that feel like this track.” Vector databases shine in these scenarios because they can handle unstructured data efficiently, scale to millions or billions of items, and deliver low-latency similarity search results that power modern AI-driven applications.

Traditional DBs are good for extact match vs vector dbs are great for similarity search.
Vector Dbs have great use in many use cases like:

Text Search, Question Answer Bot, Vision search, Find similar design, Efficient analysis of audio, Find visually similar images e.g. blue water

OpenAI Fine tuning

Fine tune Open AI by your training data to handle edge cases, complex scenarios

OpenAI provides robust fine-tuning capabilities that allow developers to adapt base models to their specific needs. Fine-tuning lets organizations train models on their own datasets so the outputs align more closely with their domain, brand voice, or task requirements. Instead of building a model from scratch, developers can start with a powerful foundation model and refine it with supervised training examples. OpenAI supports structured fine-tuning workflows, including preparing datasets, training custom variants, and evaluating model performance. This enables applications like domain-specific chatbots, personalized assistants, and tailored classification or generation tasks. Additionally, OpenAI provides monitoring, evaluation, and versioning tools, so teams can iterate safely and ensure that their fine-tuned models meet performance and compliance standards.
However, before attempting fine-tuning, it’s often more efficient to try prompt engineering or few-shot learning, as these approaches can achieve strong results with minimal effort. They are faster, cheaper, and easier to iterate on, helping determine whether fine-tuning is truly necessary.

Fine tuning Steps

Fine tuning Steps- Prepare Data in same format as Completion API Create 10-100 examples of each type

The process of fine-tuning an OpenAI model generally involves several key steps. First, you need to prepare and clean your dataset, ensuring it is in the correct JSONL format with high-quality examples that reflect the desired input-output behavior. Next, you upload the dataset to OpenAI’s platform and run validation checks to confirm the data is properly structured. After that, you train the model using OpenAI’s fine-tuning API, where the base model is adapted on your custom data to learn specific patterns or domain knowledge. Once training is complete, you’ll receive a custom fine-tuned model ID, which can be used in place of the base model for API calls. The final step is to evaluate and iterate: test the model’s performance against real-world prompts, refine the dataset if needed, and re-train to continually improve results.

Tech Stack and modeling architectures

There are many architectures - Transformer Auto encoder, GAN, Diffusion model and RL

Generative AI is based on "comprehend existing" data and determine trajectories data can take. It uses it for generation. Generative AI works by comprehending existing data, identifying patterns, and mapping the possible trajectories that data can take. It then leverages these insights to generate new content—such as text, images, or solutions—that aligns with the learned structures while exploring novel variations.

Diffusion architecture is suitable for generation

Transformer are suitable for language gentration in sequence

Generative AI (GenAI) models come in various architectures, each designed to handle specific types of tasks. The most widely used are Large Language Models (LLMs) based on the transformer architecture, which uses self-attention mechanisms to understand and generate human-like text. Transformers power models such as GPT, PaLM, and LLaMA, enabling them to capture long-range dependencies and context efficiently. Beyond text, multimodal architectures like CLIP and Flamingo combine language and vision, allowing models to interpret images alongside text. Diffusion models, such as Stable Diffusion and DALL·E, are another branch of GenAI that excel at image generation, gradually transforming noise into coherent visuals guided by textual prompts. Similarly, models like Whisper specialize in speech-to-text, leveraging sequence-to-sequence learning tuned for audio inputs.

LLM architectures also vary by design goals and efficiency strategies. Decoder-only models (e.g., GPT-4, LLaMA) excel at generative tasks like writing, coding, and reasoning, while encoder-decoder models (e.g., T5, FLAN-T5) are particularly strong in translation, summarization, and instruction following. Some models use Mixture of Experts (MoE) architectures, like Google’s Switch Transformer, which activate only subsets of parameters during inference to balance scale with efficiency. Others explore retrieval-augmented generation (RAG), integrating external knowledge sources such as vector databases to extend a model’s effective memory. These variations reflect the broader trend of tailoring architectures to achieve better performance, efficiency, and adaptability across diverse generative AI applications.

How to Evaluate Gen AI

Identify what is important - Creativity, Realisim , Diversity and then evaluate

Traditional machine learning model has evaluaition metrics like accuracy. Generative AI creates new data. It evaluation is based on subjecive measures like diversity of data, realism, nobalty, creativity. It is hard to evaluate or benchmark geenrative models.

Generative AI adoption framework

Consider High Risk vs Low Risk application. Also consider generic data vs custom data dimension

Use above dimensions to identify quardant of your use case(s). Low risk and applicability of generic data e.g content writing for sales,travel guide are easy to adopt.

Areas where universal data is available but risk of geenrating wrong results are high - it is opportunity for companies that want to train and sell custom models

Areas where task specifci fine tuning is needed, is opprtunity for services companies

Companies that want to build defendable IP will focus on create new dataset and model training on these dataset for areas where risk is high and universal dataset are not available.

Trade off and Conflicts

Challenges, Ethical Issues, Copyright issues, Data ownership question, Environmental issues

Slide 17 Generative AI challenges - generative AI 101

Generative AI has potential, but there are many challenges and open questions. One should consider these before using geenrative AI in enterprise(s)

Uncontrolled data production

LLM, Image generation can produce variety but There is very less control how new data will be generated

Uncontrolled data generation challenge - generative AI

Data ownership challenges - generative AI

Generative AI is based on "comprehend existing" data and determine trajectories data can take. It uses it for creating new data. However the method of producing new variation of data makes it controllable.

Generative Model output is unpredicatble and uncontrollable. Main issue is - how to get confidence if you want to use it in mission critical envioronment.

Large language Model (LLM) inherit bias from data they are trained on.

There are open questions on who have copy right on generated content. In future there may be new laws that will impact consumers of generative AI

LLM or Image model that are trained on universal data and produce new data are compute intensive. There are impact of enviornment one need to consider.

Enviornment concern and legal question

One need to responsibly use Generative AI

There is significant amount of compute, energy usage. There is significant amount of carbon emission. One need to ensure it is for good cause and not for producing variation(s)

Dataknobs has created set of controls to handle this

Function Calling

How To Benefit From Function Calling

OpenAI function calling is a powerful feature that enables seamless integration of AI models with structured tasks, APIs, or external systems. By defining specific functions and providing their schemas, developers can guide the AI to trigger those functions only when appropriate, enhancing the model's capability to interact with external data and services. This approach allows for greater control and precision, enabling use cases such as querying databases, fetching live information, executing computations, or even managing workflows. Function calling ensures that responses remain contextually relevant and action-oriented, bridging the gap between conversational AI and real-world applications while maintaining flexibility and scalability in AI-driven solutions.

GenAI Project Management

Framework Manage GenAI. Programs

GENAI PROGRAM MANAGEMENT FRAMEWORK

4 AREAS

THE GENAI PROJECT MANAGEMENT FRAMEWORK INVOLVES IDENTIFYING OPPORTUNITIES FOR AI INTEGRATION, DEVELOPING A PROOF OF CONCEPT, AND PROGRESSING THROUGH STAGES OF MATURITY INCLUDING PROTOTYPING, PILOTING, SCALING, AND OPTIMIZING. A ROBUST GOVERNANCE STRUCTURE ENSURES SUCCESSFUL IMPLEMENTATION THROUGH RIGOROUS TESTING, COMPLIANCE, CONTINUOUS MONITORING, AND STAKEHOLDER ENGAGEMENT.

GenAI Budget Planning

Planning Budgeting. For CIO

GENAI - How CIOs should allocate Budget

Identify Use Cases

Understand Tech stack

Understanding Costing - Training, Inference, APP

Understand how cost change with scale and prod rollout

Divide budget - People, Data, Development, API, Off the shelf, Licensing

-->

Governace for GenAI LLM and Chatbots | governance Framework for GenAI

Trustyworthy and reliable LLM and GenAI

We recognize the immense potential of LLMs to revolutionize various aspects of our lives. However, we also acknowledge the critical need to ensure their development and deployment are guided by ethical principles and safeguard human values. Above are guiding principles and framework for AI. It is further extended for GenAI. Click to see detail slides for personalziation, automation and creative scenario to specific governance items

Security Framework - For Gen AI

Identify attack Surface & Define Action Plan For Security

Establish AI Governance across enterprise. Have action plan to secure data, infrastructure and Model. See more details by clicking on slides

GPT 4o vs GPT 4 Turbo vs GPT 3.5

GPT4 has one trillon parameters. It has longer context, multimodal capabilities. It has lower rate of hallunication.

GPT 4o is latest model released by Open AI. It accept text and image as input and produce text output. It can reason across vision, text and audio

Use GPT 3.5 Turbo for ordinary task. It is cheapest. Use GPT 4o for complex task that need high quality

DALLE, TTS, Whisper are model for images, sound

GPT (Generative Pre-Trained Transformer) is family of large language model. developed by OPEN AI. GPT4 is latest generaral purpose LLM released by OpenAI. ChatGPT4 is chatbot focused LLM.

ChatGPT4 has large token length compared to GPT3.5 ChatGPT4 can process 25000 words of context. It is 8 times higher than chatGPT3.5

ChatGPT4 can understand and process visual input.

ChatGPT 4 has better programming capabilities compared to ChatGPT 3.5

ChatGPT 4 has fewer hallunication compared to ChatGPT 3.5

Digital Human

Digital Human is type of generative AI to handle complete interaction with customers

A Digital Human is an AI-powered virtual being that interacts with people naturally, combining advanced speech, vision, and emotional intelligence to simulate human-like conversations. These AI-driven personas can be used in customer service, education, healthcare, and entertainment, providing personalized and engaging interactions. With realistic facial expressions, voice modulation, and contextual awareness, digital humans bridge the gap between technology and human communication, making digital experiences more intuitive and lifelike. As AI and deep learning continue to evolve, digital humans are becoming an integral part of business and everyday life, enhancing engagement and efficiency in human-computer interactions.

Virtual Agent to Digital Human

Use this framework to determine what kind of capabilities generative AI should add

Framework for AI Human conversation generative AI 101

Virtual Agent/Digital Agent are for one off task.

Virtual assistant carry context and are ongoing engagement

Digital influencer add experience and emotion into interaction

Digtial Human provide experience/emotion for ongoing engagement.

AI Assistant vs Digital Human vs Robots

Digital Human has digital apperances Robots exist in physical world and have mobility

AI Assistant vs Digital Human vs Robot

Similarties and differences

AI Assistant do not have appearance. They are well suited for tasks. These interact thru text and voice. Digital Human has physical appearance but they exist in digital world. They use non verbal cues such as facial expression in addition to text and voice. Robot exist in physical world. They have mobility and can interact with real world enviornment to move things. Digital human can only do digial task such as give information, schedule meeting. Robot on the other hand can do task in physical enviornment.

GenAI for Technology Domain

Improve Search , Cyber Security and Audience Intelligence

Understand user intent better and improve search with embeddings. With Generative AI - do persona development and create personalized responsed. With GenAI do Proactive threat detection. Generative AI can also simulate cyber attack and help prepare model and advance technqies to handle such attacks.

Generative AI Applications in Security

As GenAI generate new data, it is extremly useful to create synthenic attacks and test security solutions

Generative AI is extremly useful for Cyber Security

Simulate phishing attack to check robustmess of security solution

Automate the analysis of security logs

Generative AI Applications in Payments Industry

Payment industry need reliability. GenAI can be used for imporving compliance, audit and fraud detection.

Payments is highly regulated area

Customer Service can be made effective in Payment industry

Audit can be made more robust in payment industry

Check validation, Fraud detection can be improved

Gen AI for Mortgage Industry

Loadn Origination Customer Service

GenAI is useful for Mortgage Industry

It can do document analysis in loan origination process and streamline it. It can help in customer service. Most important it can automate compliance in mortgage industry

Gen AI for Mobile Development

Creativity App Development

GenAI can revolutionze mobile app development.

New apps willemerge for creativity. Better personalization will be provided in future apps.

Generative AI Vendors

Open AI, Microsoft, GCP, AWS, Anthropic ..

Vendors: OpenAI, Microsoft, GCP, AWS, Anthropic, Dataknobs, Snorkel and more

Evaluation criteria : Features, accuracy, flexibility, ability to fine tune model, cost of inference, how reliable reults are

li>

Open AI and Micorosoft model is most used

li>

GCP bard provide up to date information. GCP also has TPU

AWS has large cloud share

Anthropic released caluse with 100K tokens

Hugging Face provide many smaller models

ChatGPT3.5 ChatGPT4 and Bard

GPT4 has good reasoning. ..

Open AI has chatGPT3.5 and chatGPT4.0

Microsoft Azure provide OPEN AI services integrated with Azure

Google has Bard, Vertex AI with generative AI studio. In addition google has TPUs

AWS provide cloud to use existing capabilities

Companies like Hugging Face, Anthropic are providing their own model

Generative AI capability from Hugging Face

Generative AI capability from Anthropic and comparision

AWS bedrock enable using Hugging face, anthropic or other model on AWS cloud

Hugging Face is providing various small model like BERT, GPT-3, ROBERTA, XLNET

Anthropic has build Claude. Available to use at "poe . com". There are 3 flavors even with 100K tokens.

Build Data Products With Agentic AI

Create Automate and Scale

Agent AI is a powerful, autonomous assistant designed to streamline tasks, provide real-time insights, and enhance decision-making across industries. Whether you're managing customer interactions, analyzing data, or automating workflows, Agent AI adapts to your needs using advanced AI models and contextual understanding. With seamless integrations, continuous learning, and personalized recommendations, Agent AI empowers businesses to operate more efficiently while delivering exceptional user experiences.

build Data Products using GenAI

Data Products build Right From Start

Dataknobs capablities - KREATE, KONTROLS and KNOBS.
KREATE focus on creatibvity and generation
KONTROLS provide guardrails, lineage, compliance, privacy and security.
KNOBS enable experimentation and diagnosis