Generative AI Modeling Architectures Slides | Gen AI Models


TECH STACK
TECH STACK
        
ARCHITECTURE
ARCHITECTURE
        
EVOLUTION
EVOLUTION
        


Generative AI Modeling Architectures


  • Autoncoder takes high dimensional data as input and produce compress form

  • GAN modelig architecture have generator and discriminator component. Generate generate new data and discriminator distinguish between real data and generated data

  • LSTM and Attention model are Seq2Seq model. It is based on encoder-decoder and attention mechanism. Attention mechanism help model ocus on things that matter. Transform is extennsion of attention model. It even encode position

  • Diffusion model is used in images. It is based on principls howw a substance spread in medium.

  • GAN Architecture


    A GAN model architecture is a combination of two neural networks that are trained in an adversarial manner. The first neural network, the generator, is responsible for creating new data. The second neural network, the discriminator, is responsible for distinguishing between real data and data created by the generator. The two networks are trained simultaneously, with the generator trying to fool the discriminator and the discriminator trying to correctly identify real and fake data.


    Big GAN


    BigGAN is a generative adversarial network (GAN) that uses a modified training regime to improve the quality of generated images. The main difference between BigGAN and other GANs is that BigGAN uses a progressive growing technique, which gradually increases the size of the generator and discriminator networks. This allows BigGAN to generate more realistic images than other GANs.


    Style GAN


    StyleGAN is a generative adversarial network (GAN) that uses a style-based generator architecture to generate high-quality images. The style-based generator architecture allows StyleGAN to generate images with a high level of detail and realism.


    StyleGAN 2 addresses the shortcomings of StyleGAN, such as artifacts and instability. It uses Weight demodulation instead of AdaIN and it uses Residual connections instead of progressive growing:


    VQ GAN


    VQ-GAN is a generative adversarial network (GAN) that uses a vector quantization (VQ) method to improve the quality of generated images. VQ is a technique for representing data as a discrete set of symbols. In the case of VQ-GAN, the data is represented as a discrete set of vectors. This allows VQ-GAN to generate images with a higher level of detail than other GANs.


    Auto Encoder


    Variational autoencoder (VAE) is a generative model that learns to represent data by encoding it into a latent space. The latent space is a lower-dimensional space that captures the essential features of the data. The VAE can then be used to generate new data by sampling from the latent space and decoding it back to the original space.


    Conditional Variational Auto Encoder


    A conditional variational autoencoder (CVAE) is a generative model that takes an additional input, called the condition, and generates data that is conditioned on that input. This is in contrast to a variational autoencoder (VAE), which does not take any additional inputs and generates data that is not conditioned on anything. Because of this, CVAE can be used to generate data that is specific to a particular condition. For example, a CVAE could be used to generate images of monkey that are all wearing pajamas, or to generate text or css with particular formatting style.


    Attention and Transformer


    An attention model is a neural network that learns to focus on specific parts of an input sequence. This is done by computing a weighted sum of the input sequence, where the weights are determined by the attention mechanism. The weighted sum is then used to generate the output sequence.


    Attention to Transformer model


    A transformer modeling architecture is a neural network that uses attention mechanisms to learn long-range dependencies in the input sequence. The attention mechanism allows the model to focus on specific parts of the input sequence, which is important for tasks such as machine translation and text summarization.


    Diffusion model


    Diffusion models are a type of generative model that adds noise to data gradually and then learns to reverse the process to generate new data. Diffusion models are often used for image generation, but they can also be used for other types of data, such as text and audio.


    Schedule a workshop


    Email Text or Call

    To book a workshop please send email from your business email address.

    Email to book workshop Email Address : workshop@dataknobs.com
    You can also call us, send text or whats app at +1 4253411222





    The future of creativity is generative ai. Here are slides and deep dive for Generative AI


    From the blog

    Build Dataproducts

    How Dataknobs help in building data products

    Enterprises are most successful when they treat data like a product. It enable to use data in multiple use cases. However data product should be designed differently compared to software product.

    Be Data Centric and well governed

    Generative AI is one of approach to build data product

    Generative AI has enabled many transformative scenarios. We combine generative AI, AI, automation, web scraping, ingesting dataset to build new data products. We have expertise in generative AI, but for business benefit we define our goal to build data product in data centric manner.

    Spotlight

    Generative AI slides

  • Learn generative AI - applications, LLM, architecture
  • See best practices for prompt engineering
  • Evaluate whether you should use out of box foundation model, fne tune or use in-context learning
  • Most important - be aware of concerns, issues, challenges, risk of genAI and LLM
  • See vendor comparison - Azure, OpenAI, GCP, Bard, Anthropic. Review framework for cost computation for LLM