Learning a sequence of $X$ simultaneously under the E2C framework leads to low loss values, but the space is no longer well-formed.
Learning a sequence of $\bar{X}$ conditioned on just $X_0$ performs even more poorly (further investigation needed):
No comments:
Post a Comment