✨feat: Add maxLines, maxTokens and temperature settings #5

wrapss · 2023-11-23T19:20:03Z

Introduce 'maxLines' and 'maxTokens' configurations to provide finer control over the inference engine
Introduce 'temperature' configuration to control the creativity level of model responses (lower is better in coding and by default ollama use 0.8)

ex3ndr · 2023-11-24T03:25:07Z

Perfect! Thanks!

Ipang2008

import vscode from 'vscode';
import { info, warn } from '../modules/log';
import { autocomplete } from './autocomplete';
import { preparePrompt } from './preparePrompt';
import { AsyncLock } from '../modules/lock';
import { getFromPromptCache, setPromptToCache } from './promptCache';
import { isNotNeeded, isSupported } from './filter';
import { ollamaCheckModel } from '../modules/ollamaCheckModel';
import { ollamaDownloadModel } from '../modules/ollamaDownloadModel';

export class PromptProvider implements vscode.InlineCompletionItemProvider {

lock = new AsyncLock();
statusbar: vscode.StatusBarItem;

constructor(statusbar: vscode.StatusBarItem) {
    this.statusbar = statusbar;
}

async provideInlineCompletionItems(document: vscode.TextDocument, position: vscode.Position, context: vscode.InlineCompletionContext, token: vscode.CancellationToken): Promise<vscode.InlineCompletionItem[] | vscode.InlineCompletionList | undefined | null> {

    try {

        // Ignore unsupported documents
        if (!isSupported(document)) {
            info(`Unsupported document: ${document.uri.toString()} ignored.`);
            return;
        }

        // Ignore if not needed
        if (isNotNeeded(document, position)) {
            info('No inline completion required');
            return;
        }

        // Ignore if already canceled
        if (token.isCancellationRequested) {
            info(`Canceled before AI completion.`);
            return;
        }

        // Execute in lock
        return await this.lock.inLock(async () => {

            // Prepare context
            let prepared = await preparePrompt(document, position, context);
            if (token.isCancellationRequested) {
                info(`Canceled before AI completion.`);
                return;
            }

            // Result
            let res: string | null = null;

            // Check if in cache
            let cached = getFromPromptCache({
                prefix: prepared.prefix,
                suffix: prepared.suffix
            });

            // If not cached
            if (cached === undefined) {

                // Config
                let config = vscode.workspace.getConfiguration('inference');
                let endpoint = config.get('endpoint') as string;
                let model = config.get('model') as string;
                let maxLines = config.get('maxLines') as number;
                let maxTokens = config.get('maxTokens') as number;
                let temperature = config.get('temperature') as number;
                if (endpoint.endsWith('/')) {
                    endpoint = endpoint.slice(0, endpoint.length - 1);
                }

                // Update status
                this.statusbar.text = `$(sync~spin) Llama Coder`;
                try {

                    // Check model exists
                    let modelExists = await ollamaCheckModel(endpoint, model);
                    if (token.isCancellationRequested) {
                        info(`Canceled after AI completion.`);
                        return;
                    }

                    // Download model if not exists
                    if (!modelExists) {
                        this.statusbar.text = `$(sync~spin) Downloading`;
                        await ollamaDownloadModel(endpoint, model);
                        this.statusbar.text = `$(sync~spin) Llama Coder`;
                    }
                    if (token.isCancellationRequested) {
                        info(`Canceled after AI completion.`);
                        return;
                    }

                    // Run AI completion
                    info(`Running AI completion...`);
                    res = await autocomplete({
                        prefix: prepared.prefix,
                        suffix: prepared.suffix,
                        endpoint: endpoint,
                        model: model,
                        maxLines: maxLines,
                        maxTokens: maxTokens,
                        temperature,
                        canceled: () => token.isCancellationRequested,
                    });
                    info(`AI completion completed: ${res}`);

                    // Put to cache
                    setPromptToCache({
                        prefix: prepared.prefix,
                        suffix: prepared.suffix,
                        value: res
                    });
                } finally {
                    this.statusbar.text = `$(chip) Llama Coder`;
                }
            } else {
                if (cached !== null) {
                    res = cached;
                }
            }
            if (token.isCancellationRequested) {
                info(`Canceled after AI completion.`);
                return;
            }

            // Return result
            if (res && res.trim() !== '') {
                return [{
                    insertText: res,
                    range: new vscode.Range(position, position),
                }];
            }

            // Nothing to complete
            return;
        });
    } catch (e) {
        warn('Error during inference:', e);
    }
}

}

wrapss · 2023-11-28T12:12:33Z

import vscode from 'vscode'; import { info, warn } from '../modules/log'; import { autocomplete } from './autocomplete'; import { preparePrompt } from './preparePrompt'; import { AsyncLock } from '../modules/lock'; import { getFromPromptCache, setPromptToCache } from './promptCache'; import { isNotNeeded, isSupported } from './filter'; import { ollamaCheckModel } from '../modules/ollamaCheckModel'; import { ollamaDownloadModel } from '../modules/ollamaDownloadModel';

export class PromptProvider implements vscode.InlineCompletionItemProvider {

lock = new AsyncLock();
statusbar: vscode.StatusBarItem;

constructor(statusbar: vscode.StatusBarItem) {
    this.statusbar = statusbar;
}

async provideInlineCompletionItems(document: vscode.TextDocument, position: vscode.Position, context: vscode.InlineCompletionContext, token: vscode.CancellationToken): Promise<vscode.InlineCompletionItem[] | vscode.InlineCompletionList | undefined | null> {

    try {

        // Ignore unsupported documents
        if (!isSupported(document)) {
            info(`Unsupported document: ${document.uri.toString()} ignored.`);
            return;
        }

        // Ignore if not needed
        if (isNotNeeded(document, position)) {
            info('No inline completion required');
            return;
        }

        // Ignore if already canceled
        if (token.isCancellationRequested) {
            info(`Canceled before AI completion.`);
            return;
        }

        // Execute in lock
        return await this.lock.inLock(async () => {

            // Prepare context
            let prepared = await preparePrompt(document, position, context);
            if (token.isCancellationRequested) {
                info(`Canceled before AI completion.`);
                return;
            }

            // Result
            let res: string | null = null;

            // Check if in cache
            let cached = getFromPromptCache({
                prefix: prepared.prefix,
                suffix: prepared.suffix
            });

            // If not cached
            if (cached === undefined) {

                // Config
                let config = vscode.workspace.getConfiguration('inference');
                let endpoint = config.get('endpoint') as string;
                let model = config.get('model') as string;
                let maxLines = config.get('maxLines') as number;
                let maxTokens = config.get('maxTokens') as number;
                let temperature = config.get('temperature') as number;
                if (endpoint.endsWith('/')) {
                    endpoint = endpoint.slice(0, endpoint.length - 1);
                }

                // Update status
                this.statusbar.text = `$(sync~spin) Llama Coder`;
                try {

                    // Check model exists
                    let modelExists = await ollamaCheckModel(endpoint, model);
                    if (token.isCancellationRequested) {
                        info(`Canceled after AI completion.`);
                        return;
                    }

                    // Download model if not exists
                    if (!modelExists) {
                        this.statusbar.text = `$(sync~spin) Downloading`;
                        await ollamaDownloadModel(endpoint, model);
                        this.statusbar.text = `$(sync~spin) Llama Coder`;
                    }
                    if (token.isCancellationRequested) {
                        info(`Canceled after AI completion.`);
                        return;
                    }

                    // Run AI completion
                    info(`Running AI completion...`);
                    res = await autocomplete({
                        prefix: prepared.prefix,
                        suffix: prepared.suffix,
                        endpoint: endpoint,
                        model: model,
                        maxLines: maxLines,
                        maxTokens: maxTokens,
                        temperature,
                        canceled: () => token.isCancellationRequested,
                    });
                    info(`AI completion completed: ${res}`);

                    // Put to cache
                    setPromptToCache({
                        prefix: prepared.prefix,
                        suffix: prepared.suffix,
                        value: res
                    });
                } finally {
                    this.statusbar.text = `$(chip) Llama Coder`;
                }
            } else {
                if (cached !== null) {
                    res = cached;
                }
            }
            if (token.isCancellationRequested) {
                info(`Canceled after AI completion.`);
                return;
            }

            // Return result
            if (res && res.trim() !== '') {
                return [{
                    insertText: res,
                    range: new vscode.Range(position, position),
                }];
            }

            // Nothing to complete
            return;
        });
    } catch (e) {
        warn('Error during inference:', e);
    }
}

}

if you mean "temperature:" missing, good catch, although typescript handles it very well like that, I'll fix later today for more readability.

wrapss added 3 commits November 23, 2023 20:18

✨feat: Add maxLines and maxTokens settings

528c272

📝 docs(package.json): Add descriptions for maxLines and maxTokens

049b552

✨ feat: Implement 'temperature' setting for model creativity control

f7afdc8

wrapss changed the title ~~✨feat: Add maxLines and maxTokens settings~~ ✨feat: Add maxLines, maxTokens and temperature settings Nov 23, 2023

ex3ndr merged commit 0dcffcb into ex3ndr:main Nov 24, 2023

Ipang2008 reviewed Nov 28, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

✨feat: Add maxLines, maxTokens and temperature settings #5

✨feat: Add maxLines, maxTokens and temperature settings #5

Uh oh!

wrapss commented Nov 23, 2023 •

edited

Loading

Uh oh!

ex3ndr commented Nov 24, 2023

Uh oh!

Ipang2008 left a comment

Uh oh!

wrapss commented Nov 28, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

✨feat: Add maxLines, maxTokens and temperature settings #5

✨feat: Add maxLines, maxTokens and temperature settings #5

Uh oh!

Conversation

wrapss commented Nov 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ex3ndr commented Nov 24, 2023

Uh oh!

Ipang2008 left a comment

Choose a reason for hiding this comment

Uh oh!

wrapss commented Nov 28, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

wrapss commented Nov 23, 2023 •

edited

Loading