CAREERS

Explore job openings across our companies
\n \n

\n\n\n \n \n \n\n \n\n\n
\n
\n
\n
\n
\n \"BOSS\n \n
\n
\n\n
\n
\n
\n
\n

Student Initiative - LLMaaS Data Governance

\n
    \n
  • \n \n Nashville, TN\n
  • \n
  • \nCategory: Interns
  • \n
  • \nType: Intern
  • \n
  • \nMin. Experience: Intern
  • \n
\n
\n
\n
\n
\n\n
\n
\n
\n\n
<div class="row">\n  <article class="col-xs-12 col-md-7">\n    <div class="job-descr content">\n      <p><b style="font-weight:normal;" id="docs-internal-guid-4c9fb2cf-7fff-ffbd-5fa9-1d14348437ac"></b></p>\n
\n

\n1.1      \tInitiative Summary:
\n

\n

Businesses looking to operationalize LLM-supported applications will benefit from using cloud-based (private or public) LLM “as a service” (LLMaaS) platforms for governance and scalability. Among many features, data governance (primarily for unstructured text) will be a critical offering of these platforms, including that from Blattner Technologies. This initiative will focus on contributing to the development of an extensible end-to-end data governance framework, including external data ingestion, parallelized data preparation and analytics, and versioning.

\n

 

\n


1.2      \tDesired Outcomes

\n

-   \tPrototype innovative workflow-based capabilities for preparing unstructured text in a scalable, traceable, and intuitive manner for downstream LLM-related tasks, such as training and fine-tuning.

\n

-   \tPresentation to broader company highlighting approach, challenges, solutions, and significant insights stemming from the effort.

\n

 

\n

1.3      \tCore Skills Required

\n

-   \tRequired skills:

\n

o   Fundamental LLM knowledge (e.g., prompt engineering, fine-tuning)

\n

o   NLP-based development (e.g., tokenization, embedding generation, and operations, textfication)

\n

o   Python development

\n

o   Experience with parallel distributed systems and/or parallel computation libraries such as Spark, Dask, or RAPIDS

\n

-   \tOptional/preferable skills:

\n

o   Kubeflow

\n

o   Vector databases

\n

o   Experience with NLP libraries such as spaCy and gensim

\n

 

\n


1.4      \tEstimated Effort

\n

-   \tFull-time summer internship (40 hours/week)

\n

-   \tDepending on progress, work may extend to part-time during the Fall semester (e.g., 10 hours/week)

\n

 

\n


1.5      \tAdditional Information

\n

This is a remote internship opportunity, working with summer mentors and reporting to the Chief Product Officer of BOSS AI. The group has a deep focus on implementing LLMs “as a service” (LLMaaS) and team members have a range of skills from enterprise software engineering, NLP, ML, and UX. You can expect to gain valuable experience in operationalizing LLMs and addressing critical security needs for all language models.

\n

\n
\n \n \n\n \n
</div>\n
\n
\n\n
\n
    \n
  • \n \n

    \n Should be {http://www.linkedin.com/pub/[member-name/]x/y/z}\n or {http://www.linkedin.com/in/string}\n

    \n \n
  • \n
\n
\n\n
\n\n
\n
\n
© 2014-2023 TriNet Group, Inc. All rights reserved.
\n Privacy Policy\n
\n\n
\n","datePosted":"2023-11-17T04:56:41.471Z","employmentType":[],"hiringOrganization":{"@type":"Organization","name":"superwise.ai","sameAs":"https://superwise.ai","logo":"https://cdn.filestackcontent.com/Ttam5s8dRjyBOUPy03Sg"},"jobLocationType":"TELECOMMUTE","jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"United States"}},"applicantLocationRequirements":{"@type":"Country","name":"Earth"}}

Student Initiative - LLMaaS Data Governance

superwise.ai

superwise.ai

United States · Remote
Posted on Friday, November 17, 2023

1.1      Initiative Summary:

Businesses looking to operationalize LLM-supported applications will benefit from using cloud-based (private or public) LLM “as a service” (LLMaaS) platforms for governance and scalability. Among many features, data governance (primarily for unstructured text) will be a critical offering of these platforms, including that from Blattner Technologies. This initiative will focus on contributing to the development of an extensible end-to-end data governance framework, including external data ingestion, parallelized data preparation and analytics, and versioning.

 


1.2      Desired Outcomes

-   Prototype innovative workflow-based capabilities for preparing unstructured text in a scalable, traceable, and intuitive manner for downstream LLM-related tasks, such as training and fine-tuning.

-   Presentation to broader company highlighting approach, challenges, solutions, and significant insights stemming from the effort.

 

1.3      Core Skills Required

-   Required skills:

o   Fundamental LLM knowledge (e.g., prompt engineering, fine-tuning)

o   NLP-based development (e.g., tokenization, embedding generation, and operations, textfication)

o   Python development

o   Experience with parallel distributed systems and/or parallel computation libraries such as Spark, Dask, or RAPIDS

-   Optional/preferable skills:

o   Kubeflow

o   Vector databases

o   Experience with NLP libraries such as spaCy and gensim

 


1.4      Estimated Effort

-   Full-time summer internship (40 hours/week)

-   Depending on progress, work may extend to part-time during the Fall semester (e.g., 10 hours/week)

 


1.5      Additional Information

This is a remote internship opportunity, working with summer mentors and reporting to the Chief Product Officer of BOSS AI. The group has a deep focus on implementing LLMs “as a service” (LLMaaS) and team members have a range of skills from enterprise software engineering, NLP, ML, and UX. You can expect to gain valuable experience in operationalizing LLMs and addressing critical security needs for all language models.



  • Should be {http://www.linkedin.com/pub/[member-name/]x/y/z} or {http://www.linkedin.com/in/string}