Improve function call (#279) #393

forever-ly · 2023-11-28T06:57:57Z

Description

The current function calling is implemented by parsing the docstring to JSON schema which causes inconsistency in docstring styles. A promising way to improve it is using pydantic like instructor (close [Feature Request] Improve function calling implementation #279 )
The current function calling implementation does not support Enum. For example, the get_weather_data tool may not work as expected since the input arguments like time_units should be enum types: #334. The agent may hallucinate an invalid input if enum types are not supported.(close [Feature Request] Support Enum type in function calling #360)
Support for the new version of the function call format (the functions parameter in client.chat.completions.create has been deprecated and replaced by tool_calls since openai version >= 1.2.3)
Support user-defined function/tool call json description (e.g. read from a file)

Motivation and Context

The openai function/tool call feature requires obtaining the json schema description of the function/tool. The current implementation in camel involves using camel.utils.commons.parse_doc to parse Google-style docstrings. There are the following issues:

It does not support tool call. （The functions parameter in client.chat.completions.create has been deprecated and replaced by tool_calls since openai version >= 1.2.3)
It only supports Google-style comment styles, which is not suitable when users prefer other comment styles or when introducing functions from third-party libraries. The current code (as shown below) supports only Google-style parsing."

def parse_doc(func: Callable) -> Dict[str, Any]:  
	...
    if args_section:  
        args_descs: List[Tuple[str, str, str, ]] = re.findall(  
            r'(\w+)\s*\((\w+)\):\s*(.*)', args_section)  
        properties = {  
            name.strip(): {  
                'type': type,  
                'description': desc  
            }  
            for name, type, desc in args_descs  
        }  
        for name in properties:  
            required.append(name)  
    ...

The solution is to use docstring_parser, which is a library that provides parsing for various docstring styles, including ReST, Google, Numpydoc-style, and Epydoc docstrings

Problems with type parsing. The current parse_doc gets the type information of parameters by parsing the docstring, so we have to use the json schema datatype instead of the python datatype when writing docstring, and in addition, data types such as Enum cannot be supported this way. For example the current docstring must be written as

    """Adds two numbers.  
    Args:        
    a (integer): the description of "a"    
    b (string): the description of "b"
    """

If we write it in the following form, the parsed types will be int and str, which is inconsistent with the type of the `json schema that the openai function call accepts

    """Adds two numbers.  
    Args:        
    a (int): the description of "a"    
    b (str): the description of "b"
    """

Therefore, after referring to the corresponding implementation in the llama_index and instructor library, the inspect module is used to extract the type information from the function signature instead of the docstring, and then pydantic is used to complete the conversion of python types to json schema types. More complex data types can also be supported in this implementation, such as Enum, `datatime

In conclusion, I defined get_openai_function_schema and get_openai_tool_schema to replace the parse_doc function to:

Support for the new tool call
Automatically convert python parameter types to json schema.
Support for data types such as enums and default parameters.
For example, for the following function

def test_all_parameters(  
        str_para: str,  
        int_para: int,  
        list_para: List[int],  
        float_para: float,  
        datatime_para: datetime,  
        default_enum_para: RoleType = RoleType.CRITIC,  
          
):  
    """  
    A function to test all parameter type. The parameters will be provided by user.    
    Args: 
	    str_para (str):        
	    str_para: str_para desc        
	    int_para (int): int_para desc  
        list_para (List): list_para desc        
        float_para (float): float_para desc        
        datatime_para (datetime): datatime_para desc        
        default_enum_para (RoleType): default_enum_para desc    
    """

The result of parse_doc is:

{
'name': 'test_all_parameters',
'description': 'A function to test all parameter type. The parameters will be provided by user.',
'parameters': {
	'type': 'object',
	'properties': {
	'str_para': {'type': 'str','description': 'str_para: str_para desc  '},
	'int_para': {'type': 'int', 'description': 'int_para desc'},
	'list_para': {'type': 'List', 'description': 'list_para desc'},
	'float_para': {'type': 'float', 'description': 'float_para desc'},
	'datatime_para': {'type': 'datetime', 'description': 'datatime_para desc'},
	'default_enum_para': {'type': 'RoleType','description': 'default_enum_para desc'}
	},
'required': ['str_para','int_para','list_para','float_para','datatime_para','default_enum_para']}}

The result of get_openai_function_schema is:

{
'name': 'test_all_parameters',
'description': 'A function to test all parameter type. The parameters will be provided by user.',
'parameters': {
	'$defs': {'RoleType': {'enum': ['assistant','user','critic','embodiment','default'],'type': 'string'}},
	'properties': {
		'str_para': {'type': 'string'},
		'int_para': {'type': 'integer'},
		'list_para': {'items': {'type': 'integer'}, 'type': 'array'},
		'float_para': {'type': 'number'},
		'datatime_para': {'format': 'date-time', 'type': 'string'},
		'default_enum_para': {'allOf': [{'$ref': '#/$defs/RoleType'}],
		'default': 'critic'}
		},
	'required': ['str_para','int_para','list_para','float_para','datatime_para'],
	'type': 'object'
	}
}

The result of get_openai_tool_schema is:

{
'type': 'function',
 'function': {
	'name': 'test_all_parameters',
	'description': 'A function to test all parameter type. The parameters will be provided by user.',
	'parameters': {
		'$defs': {'RoleType': {'enum': ['assistant','user','critic','embodiment','default'],'type': 'string'}},
		'properties': {
			'str_para': {'type': 'string'},
			'int_para': {'type': 'integer'},
			'list_para': {'items': {'type': 'integer'}, 'type': 'array'},
			'float_para': {'type': 'number'},
			'datatime_para': {'format': 'date-time', 'type': 'string'},
			'default_enum_para': {'allOf': [{'$ref': '#/$defs/RoleType'}],
			'default': 'critic'}
			},
		'required': ['str_para','int_para','list_para','float_para','datatime_para'],
		'type': 'object'
		}
	}

 }

In addition,the function call module needs to provide:

user-defined function call json description (e.g. read from a file)
modify the part of the json description
Verify that the json description i is valid
Therefore, I have modified class OpenFunction to provide the above functionality:
validate_openai_tool_schema: Validates the format of the tool schema against the json schema specification
get_openai_tool_scheme;set_openai_tool_schema
get_openai_function_schema;set_openai_function_schema
get_function_name;set_function_name
get_function_description;set_function_description
get_paramter_description;set_paramter_description
get_parameter;set_parameter
parameters

Types of changes

What types of changes does your code introduce? Put an x in all the boxes that apply:

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds core functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)
Example (update in the folder of example)

Checklist

Go over all the following points, and put an x in all the boxes that apply.
If you are unsure about any of these, don't hesitate to ask. We are here to help!

I have read the CONTRIBUTION guide. (required)
My change requires a change to the documentation.
I have updated the tests accordingly. (required for a bug fix or a new feature)
I have updated the documentation accordingly.

…ove_function_call

…ove_function_call fixed some erros in "improve function call"

dandansamax

Awesome! It makes the feature much easier to uses. Thanks a lot. Left some minor suggestions to improve readability of code.

camel/utils/commons.py

test/utils/test_get_openai_tool_schema.py

camel/functions/openai_function.py

test/utils/test_get_openai_tool_schema.py

dandansamax

Sorry for the late review. Your code looks awesome and let's wait for @lightaime to have a look. During waiting, could you add an example to show how to add a custom function? I believe it will be useful.

dandansamax · 2023-12-13T09:40:45Z

Tests failed because pydantic 2.x conflicts with argilla. We are solving it.

Billy1900

the test function is good and comprehensive.

Billy1900 · 2024-01-13T18:52:49Z

test/utils/test_get_openai_tool_schema.py

+        """
+        return a * b
+
+    expect_res = json.loads("""{"type": "function",


JSON could be formatted here.

…o pyproject

lightaime · 2024-01-14T04:04:03Z

Thanks @forever-ly for this very helpful PR!! This is awesome. Also thanks everyone for the review. I will go ahead and merge it. Since it is a huge and helpful PR, please feel free to review again and let me know is there any other issue.

Wendong-Fan

High-quality code!

Billy1900

clean code and its structure.

yiyiyi0817 · 2024-03-29T14:03:07Z

Hello, @forever-ly. I seem to have encountered a bug and I'm curious if get_openai_tool_schema can parse data of type Tuple[float, float]. I have a function defined as follows: def get_elevation(lat_lng: Tuple[float, float]) -> str:. However, this leads to an error: openai.BadRequestError: Error code: 400 - {'error': {'message': "Invalid schema for function 'get_elevation': In context=('properties', 'lat_lng'), array schema missing items", 'type': 'invalid_request_error', 'param': None, 'code': None}}. Interestingly, when I do not specify the float type, i.e., def get_elevation(lat_lng: Tuple) -> str:, the issue does not occur.

Then I found the output of get_openai_tool_schema for def get_elevation(lat_lng: Tuple[float, float]) -> str:
{ "type": "function", "function": { "name": "get_elevation", "description": "Retrieves elevation data for a given latitude and longitude.\nUses the Google Maps API to fetch elevation data for the specified latitude\nand longitude. It handles exceptions gracefully and returns a description\nof the elevation, including its value in meters and the data resolution.", "parameters": { "properties": { "lat_lng": { "maxItems": 2, "minItems": 2, "prefixItems": [ { "type": "number" }, { "type": "number" } ], "type": "array", "description": "The latitude and longitude for\nwhich to retrieve elevation data." } }, "required": ["lat_lng"], "type": "object" } } }
It has a key named "prefixItems" instead of "items".

But for def get_elevation(lat_lng: Tuple) -> str: "items" exist as following:
{ "type": "function", "function": { "name": "get_elevation", "description": "Retrieves elevation data for a given latitude and longitude.\nUses the Google Maps API to fetch elevation data for the specified latitude\nand longitude. It handles exceptions gracefully and returns a description\nof the elevation, including its value in meters and the data resolution.", "parameters": { "properties": { "lat_lng": { "items": {}, "type": "array", "description": "The latitude and longitude for\nwhich to retrieve elevation data." } }, "required": ["lat_lng"], "type": "object" } } }

I suspect that this issue might be due to OpenAI not accepting the "prefixItems" key, which is produced by the get_openai_tool_schema function. I'm not entirely sure if my analysis is correct, and I would greatly appreciate your insights on this matter.

Improve function call (#279)

edeb8de

forever-ly requested a review from dandansamax November 28, 2023 06:58

forever-ly and others added 6 commits November 28, 2023 16:01

Improve function call (#279)

4a4b0a0

Merge branch 'master' into improve_function_call

c47278c

Merge remote-tracking branch 'origin/improve_function_call' into impr…

7436add

…ove_function_call

Merge remote-tracking branch 'origin/improve_function_call' into impr…

7ef776b

…ove_function_call fixed some erros in "improve function call"

fixed typo mistake in common.py

014acff

Fix some format issues

cb4d4a6

dandansamax requested changes Nov 28, 2023

View reviewed changes

Changes to meet requirements

1c1fdf0

dandansamax approved these changes Dec 13, 2023

View reviewed changes

Merge remote-tracking branch 'origin/master' into improve_function_call

e0541a9

lightaime assigned forever-ly Jan 13, 2024

lightaime added the FunctionCall label Jan 13, 2024

Wendong-Fan mentioned this pull request Jan 13, 2024

Improve function calling (close #279) #308

Closed

9 tasks

Wendong-Fan requested review from Billy1900, FUYICC and Wendong-Fan January 13, 2024 15:34

lightaime and others added 4 commits January 13, 2024 20:07

Merge branch 'master' into improve_function_call

5fe18ad

minor change

d2f65a6

small fix

52ae7e0

small fix

5f94d73

Billy1900 requested changes Jan 13, 2024

View reviewed changes

lightaime added 5 commits January 14, 2024 05:22

Add tests and small format fix

dffe8b9

Add more tests

e1f0b8a

Move get schema functions, fix inconsistent docstring, add pydantic t…

1a534d9

…o pyproject

small fix

dd405ae

Added Literal type to functions

38b929b

lightaime merged commit 067b558 into master Jan 14, 2024
6 checks passed

lightaime deleted the improve_function_call branch January 14, 2024 04:05

Wendong-Fan reviewed Jan 14, 2024

View reviewed changes

Billy1900 reviewed Jan 14, 2024

View reviewed changes

ocss884 mentioned this pull request Mar 18, 2024

[Bug] ToolAssistantToolsFunction can't find #475

Merged

13 tasks

yiyiyi0817 mentioned this pull request Mar 31, 2024

[BUG] get_openai_tool_schema function not be able to parse the Tuple[float, float] type annotation #495

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve function call (#279) #393

Improve function call (#279) #393

forever-ly commented Nov 28, 2023 •

edited

Loading

dandansamax left a comment

dandansamax left a comment

dandansamax commented Dec 13, 2023

Billy1900 left a comment

Billy1900 Jan 13, 2024

lightaime Jan 14, 2024

lightaime commented Jan 14, 2024

Wendong-Fan left a comment

Billy1900 left a comment

yiyiyi0817 commented Mar 29, 2024

Improve function call (#279) #393

Improve function call (#279) #393

Conversation

forever-ly commented Nov 28, 2023 • edited Loading

Description

Motivation and Context

Types of changes

Checklist

dandansamax left a comment

Choose a reason for hiding this comment

dandansamax left a comment

Choose a reason for hiding this comment

dandansamax commented Dec 13, 2023

Billy1900 left a comment

Choose a reason for hiding this comment

Billy1900 Jan 13, 2024

Choose a reason for hiding this comment

lightaime Jan 14, 2024

Choose a reason for hiding this comment

lightaime commented Jan 14, 2024

Wendong-Fan left a comment

Choose a reason for hiding this comment

Billy1900 left a comment

Choose a reason for hiding this comment

yiyiyi0817 commented Mar 29, 2024

forever-ly commented Nov 28, 2023 •

edited

Loading