Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dataset path mismatch #143

Open
liudonglei opened this issue Aug 18, 2018 · 2 comments
Open

Dataset path mismatch #143

liudonglei opened this issue Aug 18, 2018 · 2 comments
Assignees

Comments

@liudonglei
Copy link

so far, 2018-08-18.
the data path using in the Castor/sm_cnn/create_dataset.sh such as ''../../Castor-data/TrecQA''
is NOT match with the real path in Castor-data dir.

can you please check it?

@Victor0118 Victor0118 changed the title the Dataset path mismatch Sep 13, 2018
@Victor0118
Copy link
Member

@liudonglei You are right. Current SMCNN code needs refactoring. See #128. Welcome to create your PR to contribute!

@liudonglei
Copy link
Author

liudonglei commented Sep 27, 2018

@Victor0118 Dataset path mismatch 的问题可以手动地将Castor-data/datasets下的trecqa和wikiqa目录拷贝到上层目录解决。
但是我陷入了这个问题,详见 #142
由于我对torchtext包一窍不通,琢磨不出这个< pad >标记是怎么来的、如何修正? 请作者们看一下这个问题。
执行命令为:$ python train.py --mode static --no_cuda
报错信息为:
File "train.py", line 62, in
postprocessing=data.Pipeline(lambda arr, _, train: [float(y) for y in arr]))
File "train.py", line 62, in
postprocessing=data.Pipeline(lambda arr, _, train: [float(y) for y in arr]))
ValueError: could not convert string to float: ' < pad > '

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants