Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Could not reproduce the reported result with official models #7

Open
zyzhang1130 opened this issue May 31, 2024 · 0 comments
Open

Could not reproduce the reported result with official models #7

zyzhang1130 opened this issue May 31, 2024 · 0 comments

Comments

@zyzhang1130
Copy link

zyzhang1130 commented May 31, 2024

Hi, can you confirm that the model available here: https://huggingface.co/shenwzh3/alpha-umi-planner-13b, etc. are the same model used to produce the result in your paper? I tried to use the examples given in Fig.6, 7 and Table 5, 6 with greedy decoding, but it gives quite bad results for both 13b and 7b models, unlike the reported results. Not sure what is going on..

Example:

User: I want to explore the catalog of products available in the ’Electronics’ category. Please provide me with the list of products along with their details.

planner_agent: Based on the previous action, it seems that the function "catalog_details_for_product_catalog" is not valid. To explore the catalog of products available in the 'Electronics' category, I can try using the function "catalog_details_for_ " +
    "category_name = 'Electronics' +
    "limit = 10 +
    "fields = 'product_name, product_details, product_price, product_description, product_image, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image

caller_agent: I will call the function 'catalog_details_for_' next step.
Action: catalog_details_for_
Action Input: {
  "category_name": "Electronics",
  "limit": 10,
  "fields": "product_name, product_details, product_price, product_description, product_image, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_image_url, product_
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant