Spaces:
Running
Running
| <html> | |
| <head> | |
| <meta charset="utf-8"> | |
| <meta name="description" | |
| content="LLaVA: Large Language and Vision Assistant"> | |
| <meta name="keywords" content="speech-language, multi-modal, LLM, LLaVA"> | |
| <meta name="viewport" content="width=device-width, initial-scale=1"> | |
| <title>LLaVA: Large Language and Vision Assistant</title> | |
| <link href="https://fonts.googleapis.com/css?family=Google+Sans|Noto+Sans|Castoro" | |
| rel="stylesheet"> | |
| <link rel="stylesheet" href="./static/css/bulma.min.css"> | |
| <link rel="stylesheet" href="./static/css/bulma-carousel.min.css"> | |
| <link rel="stylesheet" href="./static/css/bulma-slider.min.css"> | |
| <link rel="stylesheet" href="./static/css/fontawesome.all.min.css"> | |
| <link rel="stylesheet" | |
| href="https://cdn.jsdelivr.net/gh/jpswalsh/academicons@1/css/academicons.min.css"> | |
| <link rel="stylesheet" href="./static/css/index.css"> | |
| <link rel="icon" href="./static/images/favicon.svg"> | |
| <script src="https://ajax.googleapis.com/ajax/libs/jquery/3.5.1/jquery.min.js"></script> | |
| <script defer src="./static/js/fontawesome.all.min.js"></script> | |
| <script src="./static/js/bulma-carousel.min.js"></script> | |
| <script src="./static/js/bulma-slider.min.js"></script> | |
| <script src="./static/js/index.js"></script> | |
| </head> | |
| <body> | |
| <nav class="navbar" role="navigation" aria-label="main navigation"> | |
| <div class="navbar-brand"> | |
| <a role="button" class="navbar-burger" aria-label="menu" aria-expanded="false"> | |
| <span aria-hidden="true"></span> | |
| <span aria-hidden="true"></span> | |
| <span aria-hidden="true"></span> | |
| </a> | |
| </div> | |
| <div class="navbar-menu"> | |
| <div class="navbar-start" style="flex-grow: 1; justify-content: center;"> | |
| <a class="navbar-item" href="https://keunhong.com"> | |
| <span class="icon"> | |
| <i class="fas fa-home"></i> | |
| </span> | |
| </a> | |
| <div class="navbar-item has-dropdown is-hoverable"> | |
| <a class="navbar-link"> | |
| More Research | |
| </a> | |
| <div class="navbar-dropdown"> | |
| <a class="navbar-item" href="https://huggingface.co/spaces/LinkSoul/LLaSM" target="_blank"> | |
| LLaSM | |
| </a> | |
| <a class="navbar-item" href="https://huggingface.co/LinkSoul/Chinese-Llama-2-7b" target="_blank"> | |
| Chinese-Llama-2-7B | |
| </a> | |
| </div> | |
| </div> | |
| </div> | |
| </div> | |
| </nav> | |
| <section class="hero"> | |
| <div class="hero-body"> | |
| <div class="container is-max-desktop"> | |
| <div class="columns is-centered"> | |
| <div class="column has-text-centered"> | |
| <h1 class="title is-1 publication-title">Chinese-LLaVA</h1> | |
| <div class="is-size-5 publication-authors"> | |
| <span class="author-block" style="color:#008AD7;font-weight:normal;"> | |
| Yu Shu<sup>2</sup>,</span> | |
| <span class="author-block" style="color:#008AD7;font-weight:normal;"> | |
| Siwei Dong<sup>2</sup>,</span> | |
| <span class="author-block" style="color:#cc00d7;font-weight:normal;"> | |
| Wenhao Huang<sup>3</sup>,</span> | |
| <span class="author-block" style="color:#008AD7;font-weight:normal;"> | |
| Jialei Wang<sup>2</sup>,</span> | |
| <span class="author-block" style="color:#f68946;font-weight:normal;"> | |
| Yemin Shi<sup>1*</sup> | |
| </span> | |
| </div> | |
| <div class="is-size-5 publication-authors"> | |
| <span class="author-block" style="color:#f68946;font-weight:normal;"><sup>1</sup>LinkSoul.AI,</span> | |
| <span class="author-block" style="color:#008AD7;font-weight:normal;"><sup>2</sup>Beijing Academy of Artificial Intelligence, China,</span> | |
| <span class="author-block" style="color:#cc00d7;font-weight:normal;"><sup>3</sup>01.ai</span> | |
| <!-- <span class="author-block" style="color:#ed2f09;font-weight:normal;"><sup>3</sup>Peking University, China</span> --> | |
| </div> | |
| <div> | |
| <span class="author-block"><sup>*</sup>Corresponding author: [email protected]</span> | |
| </div> | |
| <div class="column has-text-centered"> | |
| <div class="publication-links"> | |
| <!-- Model Link. --> | |
| <span class="link-block"> | |
| <a href="https://huggingface.co/LinkSoul/Chinese-LLaVA-Cllama2" target="_blank" | |
| class="external-link button is-normal is-rounded is-dark"> | |
| <span class="icon"> | |
| <i class="fas fa-atom"></i> | |
| </span> | |
| <span>Model</span> | |
| </a> | |
| </span> | |
| <!-- Code Link. --> | |
| <span class="link-block"> | |
| <a href="https://github.com/LinkSoul-AI/Chinese-LLaVA" target="_blank" | |
| class="external-link button is-normal is-rounded is-dark"> | |
| <span class="icon"> | |
| <i class="fab fa-github"></i> | |
| </span> | |
| <span>Code</span> | |
| </a> | |
| </span> | |
| <!-- Dataset Link. --> | |
| <span class="link-block"> | |
| <a href="https://huggingface.co/datasets/LinkSoul/Chinese-LLaVA-Vision-Instructions" target="_blank" | |
| class="external-link button is-normal is-rounded is-dark"> | |
| <span class="icon"> | |
| <i class="far fa-images"></i> | |
| </span> | |
| <span>Data</span> | |
| </a> | |
| </div> | |
| </div> | |
| </div> | |
| </div> | |
| </div> | |
| </div> | |
| </section> | |
| <section class="section"> | |
| <div class="container is-max-desktop"> | |
| <!-- Abstract. --> | |
| <div class="columns is-centered has-text-centered"> | |
| <div class="column is-four-fifths"> | |
| <h2 class="title is-3">Abstract</h2> | |
| <div class="content has-text-justified"> | |
| <p> | |
| To contribute to the Chinese open source community, we adapt <a href="https://llava-vl.github.io/" target="_blank">LLaVA</a> to support visual instruction following in Chinese. | |
| </p> | |
| <p> | |
| Our model makes the following contributions: | |
| </p> | |
| <ui> | |
| <li> | |
| We adapt LLaVA to support visual instruction following in Chinese. | |
| </li> | |
| <li> | |
| We release a Chinese and English visual instruction following dataset <a href="https://huggingface.co/datasets/LinkSoul/Chinese-LLaVA-Vision-Instructions" target="_blank">Chinese-LLaVA-Vision-Instructions</a>. | |
| </li> | |
| <li> | |
| We release the code in <a href="https://github.com/LinkSoul-AI/Chinese-LLaVA" target="_blank">https://github.com/LinkSoul-AI/Chinese-LLaVA.</a> | |
| </li> | |
| <li> | |
| We release the models in <a href="https://huggingface.co/LinkSoul/Chinese-LLaVA-Cllama2" target="_blank">Chinese-LLaVA-Chinese-Llama-2-7B</a> and <a href="https://huggingface.co/LinkSoul/Chinese-LLaVA-Baichuan" target="_blank">Chinese-LLaVA-Baichuan-7B</a> | |
| </li> | |
| </ui> | |
| </div> | |
| </div> | |
| </div> | |
| <!--/ Abstract. --> | |
| </div> | |
| </section> | |
| <iframe src="https://demo.linksoul.ai/vlm/" style="width:100%; height: 100%; border: 0; position: absolute; margin-top: 100px; margin-bottom: 50px;"></iframe> | |
| <!-- <section class="section" id="Acknowledgement" style="position:fixed; bottom:0px; width:100%; display:none;"> | |
| <div class="container is-max-desktop"> | |
| <div class="columns is-centered"> | |
| <div class="column is-four-fifths"> | |
| <h2 class="title">Acknowledgement</h2> | |
| <p> | |
| This website is adapted from <a href="https://github.com/nerfies/nerfies.github.io" target="_blank">Nerfies</a>, licensed under a <a rel="license" href="http://creativecommons.org/licenses/by-sa/4.0/" target="_blank">Creative | |
| Commons Attribution-ShareAlike 4.0 International License</a>. We thank the open-source projects for giving us access to their models, including <a href="https://huggingface.co/LinkSoul/Chinese-Llama-2-7b" target="_blank">Chinese-Llama-2-7B</a> and <a href="https://llava-vl.github.io/" target="_blank">LLaVA</a> and <a href="https://huggingface.co/baichuan-inc/Baichuan-7B" target="_blank">Baichuan-7B</a>. | |
| </p> | |
| </div> | |
| </div> | |
| </div> | |
| </section> --> | |
| <script> | |
| $(window).scroll(function(){ | |
| var scrollTop = $(this).scrollTop(); | |
| var scrollHeight = $(document).height(); | |
| var windowHeight = $(this).height(); | |
| if(scrollTop + windowHeight == scrollHeight){ | |
| $('#Acknowledgement').show(); | |
| }else{ | |
| $('#Acknowledgement').hide(); | |
| } | |
| }); | |
| </script> | |
| </body> | |
| </html> | |