Following up from #57.
Against datahub docker quickstart sample data, ask:
tell me about SampleHiveDataset
The dataset is there (datahub get --urn 'urn:li:dataset:(urn:li:dataPlatform:hive,SampleHiveDataset,PROD)' returns it), but it has no description / glossary term / domain. The agent runs search_business_context, all four sub-searches come back empty, and it tells the user the dataset "doesn't exist in the catalog."
The SKILL.md for search_business_context actually says to fall through to search + get_entities when nothing is found, and both tools are wired in via build_langchain_tools. I tried calling search(query="SampleHiveDataset", type=DATASET) against quickstart GMS directly and it does return the dataset — so the fall-through path would work, the agent just isn't taking it when the empty result comes from the business-context fan-out.
Haven't dug into why the LLM stops there — could be a prompting thing, could be something else. Happy to look closer if useful.
Following up from #57.
Against
datahub docker quickstartsample data, ask:The dataset is there (
datahub get --urn 'urn:li:dataset:(urn:li:dataPlatform:hive,SampleHiveDataset,PROD)'returns it), but it has no description / glossary term / domain. The agent runssearch_business_context, all four sub-searches come back empty, and it tells the user the dataset "doesn't exist in the catalog."The SKILL.md for
search_business_contextactually says to fall through tosearch+get_entitieswhen nothing is found, and both tools are wired in viabuild_langchain_tools. I tried callingsearch(query="SampleHiveDataset", type=DATASET)against quickstart GMS directly and it does return the dataset — so the fall-through path would work, the agent just isn't taking it when the empty result comes from the business-context fan-out.Haven't dug into why the LLM stops there — could be a prompting thing, could be something else. Happy to look closer if useful.